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AFFINITY SELECTION- BASED 
SCREENING OF HYDROPHOBIC PROTEINS 



5 Cross-reference to Relat ed Applications 

This application claims the benefit of U.S. Provisional Application No. 
60/258,970 filed December 29, 2000. 

BACKGROUND OF THE INVENTION 
.10 Field of the Invention 

The invention relates to the fields of pharmacology and 
j medicine. More specifically, the invention relates to the 
! screening of hydrophobic proteins for the identification of 
the respective ligand molecules with particular relevance to 
15 the development of novel medicines and medical diagnostics. 

Summary of the Related Art 

1 Hydrophobic proteins (HPs) present a unique problem for 

the pharmaceutical industry in the development of agonists 

20 and antagonists of hydrophobic protein function. The 
difficulty arises from the fact that HPs are not easily 
purified and are difficult to work with in isolated form 
{e.g., solubility difficulties, etc.). Given the nonpolar 
nature of hydrophobic proteins, they may be found, inter 

25 alia, associated with the lipid bi-layer of a cell and the 
organelles therein. By way of nonlimiting example, the term 
hydrophobic protein includes membrane proteins, integral 
membrane proteins, transmembrane proteins, monotopic 
membrane proteins, polytopic membrane proteins, pump 

30 proteins (a subclass of enzymes), channel proteins, receptor 
kinase proteins, G protein-coupled receptor proteins, 
membrane-associated enzymes, transporter proteins, etc. 
Frequently, these proteins play an important role in intra- 
and intercellular signaling and the general relation of a 
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cell to its environment, e.g., solute movement, etc. Thus, 
hydrophobic proteins are important targets for drug 
development . 

The human geneome project will provide an enormous 
5 amount of information about the structure and function of 
hydrophilic and hydrophobic proteins encoded therein. For 
example, it is estimated that 1,700-5,000 G protein-coupled 
receptor proteins (GPCRs) will be discovered in the human 
genome (Marchese, A., et al . (1999) Trends Pharmacol. Sci. 

10 20:370; Henikoff, S., et al . (1997) Science 278:609). 
However, given the lack of suitable screening methodologies 
for the identification of ligands that bind hydrophobic 
proteins, hundreds of the GPCRs identified by the human 
genome project will be classified as orphan receptors, 

15 having no known ligand to advance their study. GPCRs are so 
important to the medical sciences that a separate database 
has been established to provide information on sequence 
data, mutant data, and ligand binding data, for example 
(Horn, F. et al . (1998) Nucleic Acids Research 26: 227-281). 

20 Thus, there is a need in the art for the development of 
screening methodologies, particularly high throughput 
methodologies, for HP ligand identification. 

In the prior art, there is no record of affinity 
selection-based screening of HPs . Instead, these targets 

25 are screened in functional assays or ligand displacement 
assays. All ligand displacement assays and most functional 
assays used to screen HPs are either performed in cell-based 
formats (for example, see Jayawickreme , C.K. and Kost, T.A., 
(1997) Current Opinion in Biotechnology 8: 629-634 and Chen, 

30 G., et al . (1999) Molecular Pharmacology 57: 125-124, which 
both dislcose cell -based melanophore assays; Mere, L. et al . 
(1999) Drug Discovery Today 4:363-369, which discloses a 
cell-based fluorescence resonance engery transfer (FRET) - 
based assay; and Schaeffer, M.T., et al . (1999) J. Receptor 

35 & Signal Transduction Research 19: 927-938, which discloses 
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a cell-based aequorin assay) or use impure cell membrane 
preparations (for example, see Cromlish et al . US Patent No. 
5,543,297, which discloses a microsome-based assay; and see 
Labella, F.S., et al . Fed. Proc . (1985) 44: 2806-2811, which 
5 discloses a radioligand displacement assay using membrane 
preparations.). These screening formats are poorly defined 
at the molecular level and suffer from low signal-to-noise 
ratios, false positives, and variability in the degree to 
which the target protein is expressed and wide variability 
10 in gene expression parameters. 
j,v More rarely, HP targets are purified for screening. 

2 For example, COX-2 purified in a detergent- solubili zed form 
III can be screened by monitoring its enzymatic activity in a 
VSI homogeneous solution phase assay wherein small molecule 
• 15 inhibitors of enzymatic activity can be identified as drug 
Si leads (see Song, Y. , et al . (1999) J. Med. Chem. 42: 1151- 
1160 and Barnett, J., et al . (1994) Biochimica et Blophysica 
fy Acta 1209: 130-139 . Alternatively, the HP may be bound to 
*t a carrier for screening purposes (see Sklar, L.A. et al . 
$20 (2000) Biotechniques 28: 976-985; Bieri, C. et al . (1999) 
^ Nature Biotechnology 17: 1105-1108; and Schmid, E.L., et al . 

(1998) Anal. Chem. 70: 1331-1338). However, screening 
assays that use functional readouts presume foreknowledge of 
the target's function. Also, as in the case of many imaging 
25 agents used for medical diagnosis, many desired protein 
ligands do not modulate an assayable function and only bind 
to the protein. 

Affinity selection to identify ligands to water-soluble 
proteins is known in the art. For example, International 
30 Publication No. WO 99/35109 by Nash et al . describes a 
method for producing mass-coded combinatorial libraries, 
which are useful in combination with affinity selection and 
the identification of the bound ligand by mass spectroscopy. 
International Patent Application No. WO 97/01755 by Jindal 
35 et al. describes the affinity selection of ligands bound to 
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a target molecule combined with the subsequent isolation of 
the ligand molecule by multidimensional chromatographic 
methodology. And U.S. Patent No. 6,020,141 by Pantoliano et 
al . describes a method of affinity selection combined with 
5 ligand identification by thermal shift assay. 

Regardless of these advances with affinity ligand 
selection and ligand identification, there still remains a 
fundamental challenge to apply affinity selection to non- 
water- soluble HP targets because of the hindering presence 
10 of excess amphiphile, which is required to maintain the pure 
II HP in a biologically active conformation. 

It is important to recognize the difference between a 
f* preparation of a water-soluble protein and a preparation of 
It pure HP. The HP is solvated through hydrophobic 

tL5 interactions between the hydrophobic parts of the HP and the 
hydrophobic moiety of the amphiphile. In a preparation of 
t pure water-soluble protein, all buffer components are 
Z hydrophilic and solvate the protein either through hydration 
W or by participating in electrostatic or ionic bonds. By 
"[20 contrast the amphiphile in preparations of pure HP imparts a 
colloidal characteristic to the solution. Typically, HPs 
are purified in 100 to 10,000-fold molar excess of 
detergent. These amphiphilic detergent molecules interact 
with both the HP and the drug molecules being screened. In 
25 addition, amphiphiles form macromolecular assemblies, like 
micelles or liposomes, that are just as large as most 
proteins. These macromolecular assemblies impart a 

colloidal characteristic to solutions of amphiphile- 
solublized HP, further distinguishing HP's from soluble 
30 proteins. 

Compared to soluble protein targets, the extra 
complexity of HP -amphiphile preparations hampers the 
detection of the bound ligands, lowers screening 
sensitivity, and yields high rate of false positive. In a 
35 typical preparation, the molecular entities responsible for 
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these complications could be identified as: HP-amphiphile 
complexes (20 uM, HP : amphiphile :: 1 : 5-250 ; MW=50-500 kD) , 
micelles (5000 uM MW=60 kD) , monomeric amphiphile (500 uM; 
MW=1200) . In an analogous water-soluble protein 

5 preparation, one would have only 20 yLM. protein. In both 
cases buffers (e.g. tris or Na-phosphate) and salt (e.g. 
NaCl or KCl) would also be present. For preparations of HP 
proteins, the presence of the various amphiphile entities 
presents extra complexity not found in soluble protein 
10 preparations. 

t Thus, there is a continuing need in the art for an 

?* affinity selection-based HP screening method that can 
II operate in the presence of an amphiphile without regard to 
«( the specific biological function of the HP target. 

§5 

BRIEF SUMMARY OF THE INVENTION 

|| The invention provides an affinity selection-based HP 

* screening method that can operate in the presence of an 
»( amphiphile without regard to the specific biological 
=20 function of the HP target. 

The present invention solves problems associated with 
affinity selection of HP ligands by enabling detection of 
the specific binding of a small drug-like molecules to an HP 
in the presence of an amphiphile. In addition, the present 
25 invention also provides novel methods and compositions of 
matter for the production of purified HPs useful for 
screening purposes. These discoveries have been exploited 
to provide the present invention, which includes 
compositions and methods. 
30 In a first aspect, the invention provides a method for 

identifying a ligand for a hydrophobic protein, the method 
comprising (a) selecting a ligand molecule by affinity 
selection by exposing a hydrophobic target protein bound by 
an amphiphile to a multiplicity of molecules to promote the 
35 formation of at least one complex between the hydrophobic 
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target protein and the ligand molecule; (b) separating the 
complex from the unbound molecules; and (c) identifying the 
ligand molecule. 

In certain embodiments of the first aspect, exposure of 
5 the hydrophobic target protein to a multiplicity of 
molecules occurs under homogeneous solution phase 
conditions. In certain embodiments of the first aspect, 
exposure of the hydrophobic target protein to a multiplicity 
of molecules occurs under heterogeneous solution phase 
10 conditions. In certain embodiments of the first aspect, 
selection of the ligand molecule is done using multi- 
Q dimensional chromatography. 

m in certain embodiments of the first aspect, the 

|j hydrophobic target protein is selected from the group 
: 15 consisting of a membrane protein, an integral membrane 
I" protein, a transmembrane protein, a monotopic membrane 
iM protein, a polytopic membrane protein, a pump protein, a 
channel protein, a receptor kinase protein, a G protein- 
ic coupled receptor protein, a membrane-associated enzyme, and 
f'20 a transporter protein. 

In certain embodiments of the first aspect, the 
multiplicity of molecules is a mass coded library of 
molecules. In certain embodiments of the first aspect, the 
multiplicity of molecules is a library of molecules that is 
25 not mass coded. In certain embodiments of the first aspect, 
the amphiphile is selected from the group consisting of (a) 
a polar lipid, (b) an amphiphilic macromolecular polymer, 
(c) a surfactant or detergent, and (d) an amphiphilic 
polypeptide. In certain embodiments of the first aspect, 
30 ligand molecule identification is done by mass spectral 
analysis. In certain embodiments of the first aspect, the 
ligand molecule is deconvoluted by mass spectral analysis. 
In certain embodiments of the first aspect, separation of 
the complex from the unbound molecules is accomplished with 
35 solid phase chromatography media. 
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In certain embodiments of the first aspect, the 
hydrophobic target protein comprises (a) at least one 
transmembrane domain sequence, (b) at least two tag 
sequences useful for affinity purification, and (c) a 
5 hydrophobic protein (HP) sequence. In certain embodiments 
thereof, the hydrophobic protein sequence is selected from 
the group consisting of (a) a membrane protein, (b) an 
integral membrane protein, (c) a transmembrane protein, (d) 
a monotopic membrane protein, (e) a poly topic membrane 
10 protein, (f) a pump protein, (g) a channel protein, (h) a 
It receptor kinase protein, (i) a G protein-coupled receptor 
Q protein, ( j ) a membrane-associated enzyme, and (k) a 
s l{ transporter protein. In certain embodiments thereof, the 
O tag sequences comprise epitope tag sequences selected from 
M5 the group consisting of (a) a FLAG tag (NH2 -DYKDDDDK-COOH) 
J" (SEQ ID NO:l), (b) an EE tag (NH2 -EEEEYMPME-COOH) (SEQ ID 

H N0:2), (c) a hemagglutinin tag (NH2 - YPYDVPDYA-COOH) (SEQ ID 
!ij N0:3), (d) a myc tag (NH2-KHKLEQLRNSGA-COOH) (SEQ ID N0:4), 
(e) an HSV tag (NH2 -QPELAPEDPED-COOH) (SEQ ID NO: 5) and (f) 
20 a rhodopsin tag (NH2 

MNGTEGPNFYVPFSNKTGWRS PFEAPQYYLAE PWQF SM - COOH ) (SEQ ID NO : 6 ) . 

In certain embodiments of the first aspect, the 
hydrophobic target protein comprises a sequence with an 
amino terminus to carboxy terminus order selected from the 
25 group consisting of (a) Tagl-Tag2-HP, (b) Tagl-HP-Tag2 , and 
(c) HP-Tagl-Tag2 . In certain embodiments of thereof, the 
invention provides a method wherein the hydrophobic target 
protein is selected from the group consisting of (a) Myc 
tag-EE tag-Human m2 muscarinic acetylcholine receptor 
30 (mAChR) (SEQ ID N0:7), (b) Flag tag-Human Beta 2 Adrenergic 
Receptor-EE tag (SEQ ID N0:8), (c) Human Neurokinin 3 
Receptor-HSV tag-Myc tag (SEQ ID N0:9), (d) Flag tag-Human 
ml mAChR-EE tag (SEQ ID NO: 10), and (e) Rat m3 mAChR -HSV 
tag-OctaHis tag (SEQ ID NO: 11). In certain embodiments 
35 thereof, the invention provides a method wherein the 
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hydrophobic target protein further comprises a heterologous 
signal sequence (SS) at the amino terminus. In certain 
embodiments thereof, the heterologous signal sequence is 
selected from the group consisting of (a) the Mel lit in 
5 signal sequence of NH2 -KFLVNVALVFMWYISYIYA-COOH (SEQ ID 
NO:12), (b) the GP signal sequence of NH2 -VRTAVLILLLVRFSEP- 
COOH {SEQ ID N0:13), (c) the Hemagglutinin signal sequence 
of NH2 -KTIIALSYIFCLVFA-COOH (SEQ ID NO:14), (d) the 
rhodopsin tag 1 signal sequence of NH2- 

„ 10 MNGTEGPNFYVPFSNKTGVVRSPFEAPQYYLAEP-COOH (SEQ ID NO: 15), and 
: - (e) the rhodopsin tag ID4 signal sequence of NH2- 
*\ GKNPLGVRKTETSQVAPA- COOH (SEQ ID NO: 16). In certain 

!?} embodiments thereof the tag sequences further comprise a 
P hexahistidine sequence (SEQ ID NO: 17) and a decahistidine 
J?i5 sequence (SEQ ID NO: 18). In yet certain embodiments thereof 
s the hydrophobic target protein is selected from the group 
p\ consisting of (a) GP67 SS-Myc tag-EE tag-Human m2 mAChR (SEQ 
t ID NO:19), (b) Mellitin SS-Flag tag-Human Beta 2 Adrenergic 
5 Receptor-EE tag(SEQ ID NO:20), (c) Hemagglutinin SS-Human 
■]20 Neurokinin 3 Receptor-HSV tag-Myc tag (SEQ ID NO: 21), (d) 
Mellitin SS-Flag tag-Human ml mAChR-EE tag (SEQ ID NO:22), 
and (e) Hemagglutinin SS-Rat m3 mAChR-HSV tag-OctaHis tag 
(SEQ ID NO:23) . 

In a second aspect, the invention provides a method of 
25 isolating a hydrophobic protein, the method comprising (a) 
purifying the hydrophobic protein by sucrose gradient 
ultracentrifugation, (b) purifying the hydrophobic protein 
by antibody affinity purification, and (c) purifying the 
hydrophobic protein by immobilized metal affinity 
30 chromatography. 

In certain embodiments of the second aspect, the 
hydrophobic protein comprises (a) at least one transmembrane 
domain sequence, (b) at least two tag sequences useful for 
affinity selection, and (c) a hydrophobic protein (HP) 
35 sequence. In certain embodiments thereof, the hydrophobic 
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protein sequence is selected from the group consisting of 
(a) a membrane protein, (b) an integral membrane protein, 
(c) a transmembrane protein, (d) a monotopic membrane 
protein, (e) a polytopic membrane protein, (f) a pump 
5 protein, (g) a channel protein, (h) a receptor kinase 
protein, (i) a G protein-coupled receptor protein, ( j ) a 
membrane-associated enzyme, and (k) a transporter protein. 
In certain embodiments of the second aspect, the tag 
sequences of the hydrophobic protein comprise epitope tag 
10 sequences selected from the group consisting of (a) a FLAG 
H tag (NH2 -DYKDDDDK-COOH) (SEQ ID NO:l), (b) an EE tag (NH2- 
fi EEEEYMPME - COOH ) (SEQ ID NO : 2 ) , (c) a hemagglutinin tag (NH2- 
fll YPYDVPDYA-COOH) (SEQ ID NO : 3 ) , (d) a myc tag (NH2- 
J*J KHKLEQLRNSGA - COOH ) (SEQ ID NO : 4 ) , (e) an HSV tag (NH2- 
pj.5 QPELAPEDPED - COOH ) (SEQ ID NO: 5) and (f) a rhodopsin tag 
$ (NH2 MNGTEGPNFYVPFSNKTGWRSPFEAPQYYLAEPWQFSM-COOH) (SEQ ID 

iL NO: 6). In certain embodiments of the second aspect, the 
ill hydrophobic protein comprises a sequence with an amino 
% terminus to carboxy terminus order selected from the group 
Ik) consisting of (a) Tagl-Tag2-HP, (b) Tagl-HP-Tag2 , and (c) 
HP-Tagl-Tag2 . 

In certain embodiments of the second aspect, the 
hydrophobic protein is selected from the group consisting of 
(a) Myc tag-EE tag-Human m2 mAChR (SEQ ID NO:7), (b) Flag 

25 tag-Human Beta 2 Adrenergic Receptor-EE tag (SEQ ID NO: 8), 
(c) Human Neurokinin 3 Receptor-HSV tag-Myc tag (SEQ ID 
NO:9), (d) Flag tag-Human ml mAChR -EE tag (SEQ ID NO:10), 
and (e) Rat m3 mAChR -HSV tag-OctaHis tag (SEQ ID NO:ll). In 
embodiments thereof, the hydrophobic protein further 

30 comprises a heterologous signal sequence (SS) at the amino 
terminus. In certain embodiments thereof, the heterologous 
signal sequence is selected from the group consisting of (a) 
the Mellitin signal sequence of NH2 -KFLVNVALVFMWYISYIYA- 
COOH (SEQ ID NO:12), (b) the GP signal sequence of NH2- 

35 VRTAVLILLLVRFSEP-COOH (SEQ ID NO:13), (c) the Hemagglutinin 
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signal sequence of NH2 -KTIIALSYIFCLVFA-COOH (SEQ ID NO: 14), 

(d) the rhodopsin tag 1 signal sequence of NH2- 

MNGTEGPNFYVPFSNKTGWRSPFEAPQYYLAEP-COOH (SEQ ID NO: 15), and 

(e) the rhodopsin tag ID4 signal sequence of NH2- 
GKNPLGVRKTETSQVAPA- COOH (SEQ ID NO: 16) . In certain 
embodiments of the second aspect, the tag sequences of the 
hydrophobic protein further comprise a hexahistidine 
sequence (SEQ ID NO: 17) and a decahist idine sequence (SEQ ID 
NO: 18) . 

In certain embodiments thereof, the hydrophobic target 
protein is selected from the group consisting of (a) GP67 
SS-Myc tag-EE tag-Human m2 mAChR (SEQ ID NO:19), (b) 
Mellitin SS-Flag tag-Human Beta 2 Adrenergic Receptor-EE 
tag(SEQ ID NO:20), (c) Hemagglutinin SS-Human Neurokinin 3 
Receptor-HSV tag-Myc tag (SEQ ID N0:21), (d) Mellitin SS- 
Flag tag-Human ml mAChR-EE tag (SEQ ID NO:22), and (e) 
Hemagglutinin SS-Rat m3 mAChR-HSV tag-OctaHis tag (SEQ ID 
NO: 23) . 

In a third aspect, the invention provides an isolated 
nucleic acid molecule suitable for hydrophobic protein 
expression, comprising (a) a vector polynucleotide sequence 
for protein expression in a eukaryotic cell, and (b) a 
polynucleotide sequence encoding an engineered hydrophobic 
protein comprising the following elements (i) an N- terminal 
methionine residue, (ii) a heterologous signal sequence 
(SS) , (iii) at least one transmembrane domain sequence, (iv) 
at least two tag sequences useful for affinity purification, 
and (v) a hydrophobic protein (HP) sequence. In certain 
embodiments thereof, the N- terminal methionine sequence and 
the heterologous signal sequence are selected from the group 
consisting of (a) MKFLVNVALVFMVVYI SYI YA (SEQ ID NO:24), (b) 
MVRTAVLILLLVRFSEP (SEQ ID NO: 25), (c) MKTIIALSYIFCLVFA (SEQ 
ID NO: 26), (d) MMNGTEGPNFYVPFSNKTGWRSPFEAPQYYLAEP - COOH (SEQ 
ID NO:27), and (e) MGKNPLGVRKTETSQVAPA- COOH (SEQ ID NO:28). 



BOSTON 1318564vl 



10 



Express Mail Label No.: EM300819915US 
Date of Deposit: December 19, 2001 



PATENT 

CASE NO. 111025.173; NG-016US 

In certain embodiments thereof, the tag sequences 
comprise epitope tag sequences selected from the group 
consisting of (a) a FLAG tag (NH2 -DYKDDDDK-COOH) (SEQ ID 
NO:l), (b) an EE tag (NH2 -EEEEYMPME-COOH) (SEQ ID 130:2), (c) 
5 a hemagglutinin tag (NH2 -YPYDVPDYA-COOH) (SEQ ID NO: 3), (d) 
a myc tag (NH2 -KHKLEQLRNSGA-COOH) (SEQ ID NO: 4), and (e) an 
HSV tag (NH2 -QPELAPEDPED-COOH) (SEQ ID NO: 5) . 

In certain embodiments of the third aspect, the 
elements of the engineered hydrophobic protein are arrayed 

10 from an amino to carboxy terminus order selected from the 
t group consisting of (a) SS-Tagl-Tag2 -HP, (b) SS-Tagl-HP- 
n Tag2, and (c) SS-HP-Tagl-Tag2 . In certain embodiments 

11 thereof, the engineered hydrophobic protein is selected from 
\\ the group consisting of (a) GP67 SS-Myc tag-EE tag-Human m2 
15 mAChR (SEQ ID N0:19), (b) Mellitin SS-Flag tag-Human Beta 2 
H Adrenergic Receptor-EE tag (SEQ ID NO:20), and (c) 
,k Hemagglutinin SS-Human Neurokinin 3 Receptor-HSV tag-Myc tag 
■<* (SEQ ID NO: 21) . In a further embodiment of the third 
n aspect, the tag sequences further comprise a hexahist idine 
10 sequence (SEQ ID NO: 17) and a decahistidine sequence (SEQ ID 

NO: 18) . 

In a fourth aspect, the invention provides a method for 
identifying a ligand for a hydrophobic protein, the method 
comprising (a) selecting a hydrophobic target protein from 

25 the group consisting of (i) a membrane protein, (ii) an 

integral membrane protein, (iii) a transmembrane protein, 
(iv) a monotopic membrane protein, (v) a polytopic membrane 
protein, (vii) a pump protein, (viii) a channel protein, 
(iX)a receptor kinase protein, (X) a G protein- coupled 

30 receptor protein, Xii) a membrane-associated enzyme, and 
(Xiii) a transporter protein, wherein the hydrophobic 
protein is bound by amphiphile selected from the group 
consisting of (i) a polar lipid, (ii) an amphiphilic 
macromolecular polymer, (iii) a surfactant or detergent, and 

35 (iV) an amphiphilic polypeptide; (b) selecting a ligand 
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molecule using multi -dimensional chromatography by affinity 
selection by exposing under homogenous or heterogeneous 
solution phase conditions the hydrophobic target protein 
bound by an amphiphile to a multiplicity of molecules from a 
5 mass-coded library to promote the formation of at least one 
complex between the hydrophobic target protein and the 
ligand molecule, (c) separating the complex from the unbound 
molecules, and (d) identifying the ligand molecule by mass 
spectral analysis. 
, 10 In a fifth aspect, the invention provides a method for 

O identifying a ligand for a hydrophobic protein, the method 
comprising (a) selecting a hydrophobic target protein from 
.fj the group consisting of (i) a membrane protein, (ii) an 
Q integral membrane protein, (iii) a transmembrane protein, 
V J5 (iv) a monotopic membrane protein, (v) a polytopic membrane 
s protein, (vii) a pump protein, (viii) a channel protein, 
|J* (iX)a receptor kinase protein, (X) a G protein-coupled 
i„L receptor protein, Xii) a membrane-associated enzyme, and 
SQ (Xiii) a transporter protein, wherein the hydrophobic 
= 20 protein is bound by amphiphile selected from the group 
consisting of (i) a polar lipid, (ii) an amphiphilic 
macromolecular polymer, (iii) a surfactant or detergent, and 
(iV) an amphiphilic polypeptide; (b) selecting a ligand 
molecule using multi-dimensional chromatography by affinity 
25 selection by exposing under homogenous or heterogeneous 
solution phase conditions the hydrophobic target protein 
bound by an amphiphile to a multiplicity of molecules from a 
library that is not mass-coded to promote the formation of 
at least one complex between the hydrophobic target protein 
30 and the ligand molecule, (c) separating the complex from the 
unbound molecules, and (d) identifying the ligand molecule 
by mass spectral analysis. 

In a sixth aspect, the invention provides a method of 
isolating a hydrophobic protein, the method comprising: (a) 
35 selecting a hydrophobic protein comprising: (i) at least one 
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transmembrane domain sequence, (ii) at least two tag 
sequences useful for affinity selection selected from the 
group consisting of: (A) a FLAG tag (NH2 -DYKDDDDK-COOH) (SEQ 
ID NO:29), (B) an EE tag (NH2 -EEEEYMPME-COOH) (SEQ ID 
5 NO:30), (C) a hemagglutinin tag (NH2 - YPYDVPDYA-COOH) (SEQ ID 
NO:31), (D) a myc tag (NH2 -KHKLEQLRNSGA-COOH) (SEQ ID 
NO:32), and (E) an HSV tag (NH2 -QPELAPEDPED-COOH) (SEQ ID 
NO:33); (iii) a hydrophobic protein (HP) sequence selected 
from the group consisting of: (A) a membrane protein, (B) an 

10 integral membrane protein, (C) a transmembrane protein, (D) 
a monotopic membrane protein, (E) a polytopic membrane 

»§ protein, (F) a pump protein, (G) a channel protein, (H) a 

11 receptor kinase protein, (I) a G protein-coupled receptor 
\\ protein, (J) a membrane-associated enzyme, and (K) a 

25 transporter protein; (b) purifying the hydrophobic protein 
H by sucrose gradient ultracentrifugation; (c) purifying the 
s i hydrophobic protein by antibody affinity purification; and 
W (d) purifying the hydrophobic protein by immobilized metal 
n affinity chromatography. 

;;E0 In a seventh aspect, the invention provides, an 

isolated nucleic acid molecule suitable for hydrophobic 
protein expression, comprising: (a) a vector polynucleotide 
sequence for protein expression in a eukaryotic cell, and 
(b) a polynucleotide sequence encoding an engineered 

25 hydrophobic protein comprising the following elements (i) an 
N- terminal methionine residue, (ii) a heterologous signal 
sequence (SS) , wherein the N-terminal methionine sequence 
and the heterologous signal sequence are selected from the 
group consisting of (1) MKFLVNVALVFMWYISYIYA (SEQ ID 

30 NO:24), (2) MVRTAVL ILL LVR F S E P (SEQ IDNO:25), (3) 
MKTI IALSYIFCLVFA (SEQ ID NO:26), (4) 

MMNGTEGPNFYVPFSNKTGWRSPFEAPQYYLAEP - COOH (SEQ ID NO: 27) and 
(5) MGKNPLGVRKTETSQVAPA- COOH (SEQ ID NO:28); (iii) at least 
one transmembrane domain sequence, (iv) at least two tag 
35 sequences useful for affinity selection selected from the 
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group consisting of (1) a FLAG tag (NH2-DYKDDDDK-COOH) (SEQ 
ID NO:l), (2) an EE tag (NH2 -EEEEYMPME-COOH) (SEQ ID NO:2), 

(3) a hemagglutinin tag (NH2 -YPYDVPDYA-COOH) (SEQ IDNO:3), 

(4) a myc tag (NH2 -KHKLEQLRNSGA-COOH) (SEQ ID NO : 4 ) , and (5) 
5 an HSV tag (NH2 -QPELAPEDPED-COOH) (SEQ ID NO:5)., and (v) a 

hydrophobic protein (HP) sequence selected from the group 
consisting of (1) a membrane protein, (2) an integral 
membrane protein, (3) a transmembrane protein, (4) a 
monotopic membrane protein, (5) a polytopic membrane 
10 protein, (6) a pump protein, (7) a channel protein, (8) a 
°J receptor kinase protein, (9) a G protein- coupled receptor 
"\ protein, (10) a membrane-associated enzyme, and (11) a 
1.3 transporter protein. 

45 BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 presents the amino acid sequence of the HP 
i| protein GP67 SS-Myc tag-EE tag-Human m2 mAChR (SEQ ID 
°* NO: 19) . 



=20 Figure 2 presents the amino acid sequence of the HP 

protein Mellitin-Flag Tag-Human Beta 2 Adrenergic Receptor- 
EE (SEQ ID NO: 20) . 

Figure 3 presents the amino acid sequence of the HP 
25 protein Hemagglutinin SS-Human Neurokinin 3 Receptor-HSV-Myc 
(SEQ ID NO: 21) . 

Figure 4 presents the amino acid sequence of the HP 
protein Mellitin-Flag Tag-Human ml mAChR-EE (SEQ ID NO: 22) . 

30 

Figure 5 presents the amino acid sequence of the HP 
protein Hemagglutinin SS-Rat m3 mAChR-HSV-OctaHis (SEQ ID 
NO: 23) . 
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Figure 6 presents SEC chromatograms represented by a 
screenshot from a computer interface developed to monitor 
the performance of the ALIS screening system. Relative 
absorbance at 230 nm is plotted on the vertical axis versus 
5 elution time following sample injection after injection onto 
the SEC column. The view compiles separation profiles for 
the analysis of six different binding reaction mixtures. In 
each case the mixture is composed of 25 uM each indomethacin 
and meclofenamate, 10 uM COX-1, and 1 each for 

10 approximately 2500 individual screening compounds. The 
t peaks eluting between 12-15 seconds correspond to the COX-1 
•j containing SEC fractions that are sent to the mass 
p spectrometer for analysis. The peaks eluting after 17 
% seconds corresponds to unbound library members. 
Il5 

Figure 7 presents mass spectral analysis showing the 
4 estimated recovery of two known COX-1 ligands (NSAID LM, 
W composed of indomethacin and meclofenamate) extracted from 
!i test libraries as described in Fig. 1. Different COX-1 
itO preparations from different days (10/15 and 10/18) bind the 
known ligands in the absence and presence of competing 
libraries (NGL-15-A-137, NGL-10-A-41, NGL-116-A-470 , NGM-51, 
NGM-108, NGM-177) . By comparison to standard curves, the 
mass spectral analysis permits estimation of the pmol of 
25 each ligand recovered. Estimates were performed in 

triplicate for both indomethacin and meclofenamate. 

Figure 8 presents the structure of an example COX-1 
ligand identified by ALIS. This compound, termed NGL-177-A- 
30 1128-A-2a, is one example of a compound identified as a COX- 
1 ligand by ALIS screening. 

Figure 9 presents a bar graph demonstrating competition 
with meclofenamate for COX-1 binding. Selected COX-1 hit 
35 compounds, each present at approximately 1 |J,M and identified 
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by ten character names prefixed with NGL-x, were 
individually tested to determine whether they compete with 
25 |iM meclof enamate for binding to COX-1. The mass spectral 
response corresponding to the mass of either meclof enamate 
5 or the test ligand was quantified. For test ligands that 
are competitive with meclof enamate for COX-1 binding, the 
"ligand + competitor" response will be lower that the 
"ligand - competitor" response. Also, the meclof enamate 
response will be lower for that "ligand + competitor" trial 
, 10 than in the "C0X1 + 25 |iM Meclof enamate trial." For 
-l. example, the test compound represented by NGL-169-A-1151-A-4 

• ! is competitive with meclof enamate while the test compound 

represented by NGL-175-A-1127-A-1 is not significantly 
W competitive . 

S 5 

* Figure 10 presents a bar graph demonstrating the extent 
r. I of M2R1 ligand recovery quantified by the signal strength of 
jf the mass spectral analysis in accordance with the ALIS 

procedure. The x-axis is in relative units of mass 
jpJO spectrometric signal response for the respective masses of 
pirenzepine, QNB, and atropine. 



DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

The invention relates to the fields of pharmacology and 
25 medicine. More specifically, the invention relates to the 
screening of hydrophobic proteins for the identification of 
the respective ligand molecules with particular relevance to 
the development of novel medicines and medical diagnostics. 

The patent and scientific literature cited herein 
30 establishes the knowledge that is available to those with 
skill in the art. The issued U.S. patents, allowed 
applications, published foreign applications, and references 
cited herein are hereby incorporated by reference. Any 
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conflicts between these sources and the present 
specification shall be resolved in favor of the latter. 

The invention provides an affinity selection-based HP 
screening method that can operate in the presence of an 
5 amphiphile without regard to the specific biological 
function of the HP target. 

Aspects of the invention utilize techniques and methods 
common to the fields of molecular biology, cell biology and 
immunology. Useful laboratory references for these types of 
10 methodologies are readily available to those skilled in the 

hA art. See, for example, Molecular Cloning. A Laboratory 

Manual, 2nd. edition, edited by Sambrook, J., Fritsch, B. F. 
i|] and Maniatis, T . , (1989), Cold Spring Harbor Laboratory 

Press; Current Protocols In Molecular Biology and Current 

IVL5 Protocols in Immunology , Wiley Interscience, New York; 
u! Harlow & Lane, Antibodies: A Laboratory Manual , Cold Spring 
? Harbor Press, Cold Spring Harbor, N.Y. (1988). 

f|| The invention herein relates to the preparation and 

%t purification of hydrophobic proteins and to methods for the 
flko identification of ligands that bind specifically to 
M hydrophobic proteins. As used herein, the term "hydrophobic 
protein" refers to any purified protein that is prepared 
with amphiphile. Alternatively, the term may also refer to 
any protein for which, when purified to greater than 1% 
25 purity or purified to greater than 10% purity or purified to 
greater than 25% purity or purified to greater than 50% 
purity, either requires or benefits from the presence of an 
amphiphile for functional assays, to enhance stability 
(shelf-life or ability to withstand freeze-thaw cycles) , or 
30 to retain conformational integrity as observed by common 
laboratory techniques including enzymatic analysis, ligand 
binding assays, circular dichroism, hydrodynamic assessments 
of mean size, shape, or density, interaction with 
conformation- specific antibodies. In a preferred embodiment 
35 the hydrophobic protein of the invention is a mammalian 
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hydrophobic protein. In a particularly preferred 

embodiment, the hydrophobic protein of the invention is a 
human hydrophobic protein. 

The term "hydrophobic protein" is also meant to include 
5 proteins identified by bioinf ormatics-assisted means through 
the use of the following non- limiting examples of algorithms 
designed for the identification of hydrophobic proteins: (a) 
DAS - Prediction of transmembrane regions in prokaryotes 
using the Dense Alignment Surface method (Stockholm 
...10 University) M. Cserzo, E. Wallin, I. Simon. G. von Heiine 
=1 and A. Elofsson: Prediction of transmembrane alph a-helices 
°3 in prokaryotic membrane proteins: the Dense Alignment 
y Surface method; Prot . Eng. vol. 10, no. 6, 673-676,1997 ; (b) 
;{ HMMTOP - Prediction of transmembrane helices and topology of 
fl5 proteins (Hungarian Academy of Sciences) G . E Tusnady and I . 

Simon (1998) Principles Governing Amino Acid Composition of 
»i Integral Membrane Proteins: Applications to Topology 

Prediction. J. Mol . Biol. 283. 489-506 ; (c) Hidden Markov 

;f Model Predictions ELL Sonnhammer, G. von Heiine, and A. 

=20 Krogh: A hidden Markov model for predicting transmembrane 
helices in protein sequences. Proc. of the Sixth Inte rn 
Conf. on Intelligent Systems for Molecular Biology (ISMB98) , 
175-182. 1998 ; (D) TMAP - Transmembrane detection based on 
multiple sequence alignment (Karolinska Institut; Sweden) No 
25 reference available: see URL at http//www.mbb . ki . se/tmap/ ; 

and (e) TopPred 2 - Topology prediction of membrane proteins 
(Stockholm University) . "Membrane Protein Structure 
Prediction, Hydrophobicity Analysis and the Positive- inside 
Rule". Gunnar von Heijne. J. Mol. Biol. (1992) 225, 487-494 
30 and M. Cserzo. E. Wallin. I. Simon. G. von Heiine an d A. 
Elofsson: Prediction of transmembrane alpha-helices in 
prokaryotic membrane proteins: the Dense Alignment Surface 
method; Prot. Eng. vol. 10, no. 6, 673-676,1997 . 

For a comparison of these methods see "Predic tion of 
35 transmembrane alpha-helices in prokaryotic membrane 
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proteins: the dense alignment surface method", Miklos 
Cserzo, Erik Wallin, Istvan Simon, Gunnar von Heiine, and 
Arne Elofsson, to appear in Protein Engineering, vol. 10, 
no. 6 , (1997) . 
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For exemplary purposes only, non-limiting examples of 
hydrophobic proteins (as the term is used herein) are 
presented in Table 1. These proteins are listed in GenBank, 
as indicated by the locus designations from the National 
Center for Biotechnology Information (NCBI) . 



Table 1: Non-Limitina Examples of Hydrophobic Proteins 


Common Name 


NCBI Locus 


Kvl.3 

Shaker Family K + Channel 
Polytopic 


LOCUS NP 002223 523 aa PRI 
31-OCT-2000 DEFINITION 
potassiumvoltag e-gated 
channel, shaker- related 
subfamily, member 3 [Homo 
sapiens] . ACCESSION 
NP 002223 
PID g4504815 
VERSION NP 002223.1 
GI : 4504815 


m2 Muscarinic Acetylcholine 
Receptor 

G Protein-Coupled Receptor 
Class A, Polytopic 


LOCUS NP 00073 0 466 aa PRI 
31-OCT-2000 DEFINITION 
cholinergic receptor, 
muscarinic 2 ; muscarinic 
acetylcholine receptor M2 
[Homo sapiens] . ACCESSION 
NP 000730 
PID g4502817 
VERSION NP 000730.1 
GI : 4502817 

DBSOURCE REFSEQ: accession 
NM 000739.1 


Secretin Receptor 

G Protein-Coupled Receptor 

Class B, Polytopic 


LOCUS NP 002971 440 aa PRI 
31-OCT-2000 

DEFINITION secretin receptor 
[Homo sapiens] . 
ACCESSION NP 002971 
PID g4506825 
VERSION NP 002971.1 
GI : 4506825 

DBSOURCE REFSEQ: accession 
NM 002980.1 
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Table 1: Non-Limitina Examples of Hydrophobic Proteins 


Common Name 


NCBI LOCUS 


Metabotropic Glutatmate 
Receptor, Type 4 
G Protein-Coupled Receptor 
Class C, Polytopic 


LOCUS NP 000832 912 aa PRI 
31-OCT-2000 
DEFINITION glutamate 
receptor, metabotropic 4 
[Homo sapiens] . ACCESSION 
NP 000832 
PID g4504141 
VERSION NP 000832.1 
GI : 4504141 

DBSOURCE REFSEQ: accession 
NM 000841.1 


Epidermal Growth Factor 
Receptor 

Transmembrane Receptor Kinase 


LOCUS AAB19486 10 aa PRI 
29- JUN-2000 

DEFINITION epidermal growth 
factor receptor; EGFR [Homo 
sapiens] . ACCESSION AAB194 86 
PID g8815559 
VERSION AAB19486.2 
GI : 8815559 

DBSOURCE locus S51343 

accession 

S51343 .1 


Cyclooxygenase-2 {COX-1) 
Integral Membrane Enzyme 
Mono topic 


LOCUS PGH 2 HUMAN 604 aa 
PRI 15-DEC-1998 

DEFINITION PROSTAGLANDIN G/H 

SYNTHASE 2 PRECURSOR 
(CYCLOOXYGENASE-2) {COX-1) 
( PROSTAGLANDIN- ENDOPEROXIDE 

SYNTHASE 2) (PROSTAGLANDIN H2 

SYNTHASE 2) (PGH SYNTHASE 2) 
(PGHS-2) (PHS II) . 

ACCESSION P35354 

PID g3915797 

VERSION P35354 GI: 3915797 
DBSOURCE swissprot : locus 
PGH 2 HUMAN, accession P353 54 


Ca ++ ATPase 

Integral Membrane Enzyme 
Polytopic 


LOCUS NP 001675 1205 aa 
PRI 31-OCT-2000 
DEFINITION ATPase, Ca++ 
transporting, plasma membrane 
4 [Homo sapiens] . ACCESSION 
NP 001675 
PID g4502289 
VERSION NP 001675.1 
GI : 4502289 

DBSOURCE REFSEQ: accession 
NM 001684.1 
EC 3 .6.1.38 



BOSTON 1318564vl 



21 



Express Mail Label No.: EM300819915US 
Date of Deposit: December 19, 2001 



PATENT 

CASE NO. 111025.173; NG-016US 



Table 1: Non-Limitincr Examples of Hydrophobic Proteins 


Common Name 


NCBI Locus 


Cytochrome c Oxidase 
Integral Membrane Enzyme 
Polytopic 


13 Distinct Polypeptide 
Subunits 

See Protein Data Bank #10CC 
for details. 

http: //www.rcsb.org/pdb/cgi/e 
xplore . cgi? j ob=chains&pdbId=l 
OCC&page=&pid=4725 
EC 1.9.3.1 


Aquaporin, Type 3 
Channel, Polytopic 


LOCUS NP 004916 292 aa PRI 
Ol-NOV-2000 

DEFINITION aquaporin 3 [Homo 

sapiens] . ACCESSION 

NP_004916 

PID g4826645 

VERSION NP 004916.1 

GI : 4826645 

DBSOURCE REFSEQ: accession 
NM 004925.2 


Outer Membrane Phospholipase 
A 

Integral Membrane Enzyme 
Polytopic p-Barrel 


See Protein Data Bank #1QD5 
for details. EC 3.1.1.32 


Serotonin Transporter 
Transporter, Topology Unknown 


LOCUS NP 001036 630 aa PRI 
31-OCT-2000 

DEFINITION solute carrier 
family 6 (neurotransmitter 
transporter, serotonin) , 
member 4 ; Solute carrier 
family 6 (neurotransmitter 
transporter, serotonin) , 
[Homo sapiens] . 
ACCESSION NP 001036 
PID g4507043 
VERSION NP 001036.1 
GI : 4507043 

DBSOURCE REFSEQ: accession 
NM 001045.1 


Erythropoietin Receptor 
Non- Enzymatic Transmembrane 
Receptor 


LOCUS NP 058698 507 aa ROD 
Ol-NOV-2000 

DEFINITION erythropoietin 
receptor [Rattus norvegicus] . 
ACCESSION NP 058698 
PID g8393319 
VERSION NP 058698.1 
GI : 8393319 

DBSOURCE REFSEQ: accession 
NM 017002.1 
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As will be understood by those in the art, a method of 
identifying a ligand for a hydrophobic protein is synonymous 
with a method of screening for a ligand that binds a small 
molecule. Furthermore, as used herein, the term "screening" 
5 refers to a procedure used to detect the interaction between 
polypeptide, for example a hydrophobic protein, and a small 
molecule; it is useful for discriminating between ligands 
that bind to proteins with a Kd < 200|lM from large ensembles 
of ligands that either do not bind to the protein or bind 
1,10 only weakly with a Kd > 2 0 0}1M. 

G The present invention utilizes mass spectrometry {MS ) 

pi in the identification of hydrophobic protein ligands. The 
MS technique is only rarely performed to analyze samples 
| containing hydrophobic proteins because these samples 
,;U5 contain detergent amphiphiles. Detergents suppress analyte 
ion formation, a critical phenomenon for MS, and so 
m significantly hamper MS, that reports of successful MS 
H analysis of hydrophobic proteins are few. Nevertheless, 
several labs have tried to perform MS analysis of purified 
jh20 hydrophobic proteins by identifying methods to remove the 
detergent prior to MS analysis. All of these labs use 
matrix-assisted laser desorption ionization (MALDI) to 
present the protein sample to the detector. 

However, all of the known methods for sample 
25 preparation of hydrophobic proteins use organic solvents 
and/or acid to extract the detergent from the polypeptide 
prior to MS analysis. Such treatment denatures the 

polypeptide, a fact that precludes the binding of ligands to 
the analyte hydrophobic protein. For researchers interested 
30 in using MS for the study of hydrophobic protein- ligand 
interactions, denaturing preparation methods are not 
suitable. Moreover, the preparation of a membrane protein 
for analysis by MALDI -MS is laborious compared to the method 
provided by the present invention. 
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By contrast, in certain preferred embodiments the 
present invention uses electrospray ionization (ESI) MS 
which permits the fluid handling of a membrane protein 
sample as it passes from the SEC separation to the MS 
5 detector. As such, the MS analysis proceeds in less than 3 0 
seconds after the sample is sent to the RPC column. 

The reasons why detergents must be removed from protein 
samples prior to mass spectrometric analysis are well known, 
and are provided, e.g., in the following references: (a) 

WO BioTechniques (1997) 22:244-250; J. PC. Vissers, J.-P 

Chervet K. Sanborn, and J.-P Salzmann; (b) Protein Science 

fy (1994) 3:1975-1983; R.R. Ogorzalek Loo, N. Dales and P.C. 
Andrews; (c) J. Mass. Spectrom. (1995) 30:1462-8. Rosinke B, 

ip Strupat, K, et al . ; Methods Enzymol . (1996) 270:519-51; 

L ' 1: i5 Beavis, RC and Chait, BT; and Proc . Natl. Acad. Sci. USA 

U, (1990) 87:6873-7; Beavis, RC and Chait, BT. ; Fearnley, I.M. 

»' ? et a;., Biochem. Soc . Trans., (1996) 24:12-917. 

H 

In a first aspect, the invention provides a method for 
H identifying a ligand for a hydrophobic protein, the method 
20 comprising (a) selecting a ligand molecule by affinity 
selection by exposing a hydrophobic target protein bound by 
an amphiphile to a multiplicity of molecules to promote the 
formation of at least one complex between the hydrophobic 
target protein and the ligand molecule; (b) separating the 
25 complex from the unbound molecules; and (c) identifying the 
ligand molecule. 

The affinity screening methods of the invention are to 
be distinguished from functional selection methodologies. 
Functional selection methodologies involve ligand selection 
30 based on criteria that identify some specific protein-ligand 
or protein-protein interaction as significant; such ligand 
selection depends on either an action assignable to the 
protein (e.g., chemical catalysis for enzymes) or 
identification of some interaction between the protein and 
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some other molecule (e.g., interaction between a protein and 
a known small molecule ligand in the case of non- enzymes) . 
In summary, these screening methodologies are generally 
based on enzymatic or biof unctional assays or ligand 
5 displacement assays. 

Various embodiments of the method of the invention may 
utilize an automated ligand identification system (ALIS) for 
the discovery of novel drug leads. ALIS selects ligands 
based on the affinity of the compound for its target protein 
10 and identifies the ligands by mass spectrometry (see 
International Patent Application WO 99/35109) . The 
invention herein provides for the application of ALIS to 
q amphiphile complexed HPs . 

f II As used herein, the term "affinity selection" means a 

jp|5 ligand selection based on the affinity of one molecule for a 
;:- selected protein target; such ligand selection is 
^ independent of the functional activity of the protein of 
L interest other than for the fact the protein binds the small 
j|W molecule. 

20 As used herein, the term "amphiphile" is used to mean 

O any molecule generally with the properties of a detergent, 
!?! " phospholipid, or surfactant that enhances the water 
solubility of hydrophobic polypeptides; specifically any 
molecule known to assume an association colloid in aqueous 

25 solution; non- limiting examples of such amphiphiles would 
include phospholipids and other polar lipids (exemplified by 
phosphatidylcholines, lysophospholipids , cholesterols , 
lecithins, ceramides, etc) ; amphiphilic macromolecular 
polymers (exemplified by the work of Christophe Tribet and 

30 Jean- Luc Popot (Tribet, C. et al . J.L. Natl. Acad. Sci . USA 
(1996) 93:15047-50); surfactants including alkyl 

saccharides, alkyl thioglycosides , alkyl dimethylamine 
oxides, bile acid derivatives like cholate and the CHAPS 
series, FOS-CHOLINE™ series, CYMAL™ or CYGLU™ series, 

35 glucamides, and alkyl polyoxyethylenes , etc; or polypeptides 
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known to adopt amphipathic structures (exemplified by work 
of C.E. Schafmeister and R.M. Stroud (Schaf meister , C.E. et 
al., RM Science (1993) 262:734-8). 

As used herein, the term "multiplicity of molecules" 
5 refers to a plurality of molecules to be tested for the 
property of specific binding to a hydrophobic target 
protein. By the term "molecule" is meant, any compound in 
the size range of 150 to 5000 atomic mass units (amu) . Such 
compounds may be generated by any means known in the art . 
10 Particularly favored methods of generating the multiplicity 
„t of molecules is through the use of combinatorial chemistry. 
::f A "combinatorial library" refers to a plurality of molecules 
|i or compounds which are formed by combining, in every 
i=l possible way for a given compound length, a set of chemical 
»J5 or biochemical building blocks which may or may not be 
0 related in structure. Alternatively, the term can refer to 
. a plurality of chemical or biochemical compounds which are 
1| formed by selectively combining a particular set of chemical 
building blocks. For example, twenty amino acids randomly 
JO combined into hexameric peptides will produce no less than 
$ 64 million compounds. With the "combinatorial library" 
approach, as many different compounds as possible are made, 
and then candidate compounds are selected by screening them 
for binding activity against the target molecule of 
25 interest, e.g., a hydrophobic protein. 

There are now many methods well known in the art for 
the construction of combinatorial libraries. By way of non- 
limiting example, the following references provide methods 
for combinatorial library construction: U.S. Patent nos . 
30 6, 147,344; (WO 99/35109; 6, 114,309; 6, 025, 371; 6,017,768; 
5,962,337; 5,919,955; and 5,856,496, to name a few. As used 
herein, the term "multiplicity of molecules" may also refer 
to a natural plurality of molecules or compounds, obtained 
for example from body fluids, tissues or cells. These 
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samples may be manipulated, e.g., proteolytically digested, 
in vitro prior to their use in a ligand screening protocol. 

In certain embodiments of the first aspect, exposure of 
the hydrophobic target protein to a multiplicity of 
5 molecules occurs under homogeneous solution phase 
conditions. In certain embodiments of the first aspect, 
exposure of the hydrophobic target protein to a multiplicity 
of molecules occurs under heterogeneous solution phase 
conditions. In certain embodiments of the first aspect, 
10 selection of the ligand molecule is done using multi- 
| oj . dimensional chromatography. 

O As used herein, the term "homogeneous solution phase" 

pj means a protein preparations whereby a protein is combined 
a;j with (a) ligand (s) with the intent to facilitate possible 
1 15 protein- ligand interactions; such preparations are found as 
sjfj sols in the temperature range of -4 0 to 6 0 °C such that 
5 neither protein nor ligand are bound to a supporting 

« t element; these preparations would either pass through a 
hi semipermeable membrane with a size cut-off of 5.0 uM or 
j20 behave as though they a sedimentation coefficient of less 
than 500 Svedbergs or both; examples of such preparations 
would include combinations of ligand (s) with proteins that 
are solubilized in amphiphile, proteins incorporated into 
proteoliposomes, proteins incorporated into cell-derived 
25 virus-like particles, etc. 

As used herein, the term "heterogeneous solution phase" 
means a protein preparations whereby a protein is combined 
with (a) ligand (s) with the intent to facilitate possible 
protein- ligand interactions; such preparations are found as 
30 mixtures in the temperature range of -40 to 60°C such that 
either the protein or the ligand is bound to a supporting 
element; these preparations would either fail to pass 
through a semipermeable membrane with a size cut-off of 5.0 
uM or they would behave as though they have a sedimentation 
35 coefficient of greater than 500 Svedbergs or both; examples 
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of such preparations would include combinations of ligand(s) 
with proteins that are presented on the surface of a 
bead/stationary element whereby the protein is attached to 
the bead/stationary element through either a covalent or 
5 non- covalent linkage or a linkage dependent upon the self- 
association of amphiphiles or combinations of proteins with 
ligands that are fixed to a stationary phase. 

As used herein, the term "multi -dimensional 
chromatography" means a procedure for processing a sample 
10 involving more than one chromatographic method in tandem. 
Representative types of chromatographic methods include: (1) 
III solid phase chromatography media: any of a variety of 

f|| materials including small particles (<5 uM) , solid porous 
■jl castings, filters, or semipermeable membranes that may 
□15 commonly be referred to as resins, gels, immobilized 
'41 artificial membranes, stationary phase elements, or 
otherwise, and are used with the intent of providing 
ill stationary surfaces over which or through which solubilized 
analytes are passed or with which solubilized analytes 
•~20 interact as in chromatographic or electrophoretic 
H separations or fractionations; and (2) solution phase 
chromatography media: solutions or fluids suitable for use 
in electrophoretic separations or fractionations when used 
in combination with stationary phase chromatography media. 
25 In certain embodiments of the first aspect, the 

hydrophobic target protein is selected from the group 
consisting of a membrane protein, an integral membrane 
protein, a transmembrane protein, a monotopic membrane 
protein, a polytopic membrane protein, a pump protein, a 
30 channel protein, a receptor kinase protein, a G protein- 
coupled receptor protein, a membrane-associated enzyme, and 
a transporter protein. 

In certain embodiments of the first aspect, the 
multiplicity of molecules is a mass coded library of 
35 molecules. In certain embodiments of the first aspect, the 
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multiplicity of molecules is a library of molecules that is 
not mass coded. 

As used herein, the term "mass-coded library" refers to 
a mass coded combinatorial library. The compounds of the 
5 mass -coded combinatorial library are of the general formula 
X(Y) n , wherein X is a scaffold, each Y is a peripheral 
moiety and n is an integer greater than 1, typically from 2 
to about 6. The term "scaffold", as used herein, refers to 
a molecular fragment to which two or more peripheral 
10 moieties are attached via a covalent bond. The scaffold is 
a molecular fragment which is common to each member of the 
!p mass-coded set of compounds. The term "peripheral moiety", 
FU as used herein, refers to a molecular fragment which is 
bonded to a scaffold. Each member of the set of mass -coded 
C15 compounds will include a combination of n peripheral 
% " moieties bonded to the scaffold and this set of compounds 
JUb forms a mass -coded combinatorial library. More details of 
'sif mass-coded libraries are provided in the patent application 
%-| WO9935109A1, which is incorporated herein by reference. 
120 As used herein, the phrase "a library of molecules that 

- is not mass-coded" means any plurality of molecules or 
compounds that are not produced by a mass -coded 
combinatorial process. Thus, the term includes any and all 
other methods of producing a combinatorial library. In 
25 addition, the term also includes compounds constructed by 
"Structure Based Drug Design" methodology, which seeks to 
design a drug based on the structure of the target protein, 
and natural libraries of compounds obtained from body 
fluids, tissues or cells. 
30 In certain embodiments of the first aspect, the 

amphiphile is selected from the group consisting of (a) a 
polar lipid, (b) an amphiphilic macromolecular polymer, (c) 
a surfactant or detergent, and (d) an amphiphilic 
polypeptide. In certain embodiments of the first aspect, 
35 ligand identification is done by mass spectral analysis. In 
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certain embodiments of the first aspect, the ligand molecule 
is deconvoluted by mass spectral analysis. In certain 
embodiments of the first aspect, separation of the complex 
from the unbound molecules is accomplished with solid phase 
5 chromatography media. 

By the term "ligand identification" as used herein is 
meant any process that can accurately specify the structural 
composition of a small molecule detected in a screen. 
In certain embodiments of the first aspect, the 
10 hydrophobic target protein comprises (a) at least one 
^ transmembrane domain sequence, (b) at least two tag 
fi\ sequences useful for affinity selection, and (c) a 
III hydrophobic protein (HP) sequence. In certain embodiments 
pi thereof, the hydrophobic protein sequence is selected from 
•£115 the group consisting of (a) a membrane protein, (b) an 
^ integral membrane protein, (c) a transmembrane protein, (d) 
jj»ti a monotopic membrane protein, (e) a polytopic membrane 
Hi protein, (f) a pump protein, (g) a channel protein, (h) a 
;*p receptor kinase protein, (i) a G protein-coupled receptor 
1*120 protein, (j) a membrane-associated enzyme, and (k) a 

transporter protein. In certain embodiments thereof, the 
tag sequences comprise epitope tag sequences selected from 
the group consisting of (a) a FLAG tag (NH2 - DYKDDDDK- COOH ) 
(SEQ ID NO:l), (b) an EE tag (NH2 -EEEEYMPME-COOH) (SEQ ID 
25 NO:2), (c) a hemagglutinin tag (NH2 -YPYDVPDYA-COOH) (SEQ ID 
NO:3), (d) a myc tag (NH2 -KHKLEQLRNSGA-COOH) (SEQ ID NO : 4 ) , 
(e) an HSV tag ( NH2 - QPELAPEDPED - COOH ) (SEQ ID NO : 5 ) and (f) 
a rhodopsin tag (NH2GTEGPNFYVPFSNKTGWRSPFEAPQYYLAEPWQFSM- 
COOH) (SEQ ID NO: 6) . 
30 In certain embodiments of the first aspect, the 

hydrophobic target protein comprises a sequence with an 
amino terminus to carboxy terminus order selected from the 
group consisting of (a) Tagl-Tag2 -HP, (b) Tagl-HP-Tag2 , and 
(c) HP-Tagl-Tag2 . In certain embodiments of thereof, the 
35 invention provides a method wherein the hydrophobic target 
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protein is selected from the group consisting of (a) Myc 
tag-EE tag-Human m2 mAChR (SEQ ID N0:6), (b) Flag tag-Human 
Beta 2 Adrenergic Receptor-EE tag (SEQ ID N0:7), (c) Human 
Neurokinin 3 Receptor-HSV tag-Myc tag (SEQ ID NO:9), (d) 
5 Flag tag-Human ml mAChR- EE tag (SEQ ID NO:10), and (e) Rat 
m3 mAChR-HSV tag-OctaHis tag (SEQ ID NO:ll). In certain 
embodiments thereof, the invention provides a method wherein 
the hydrophobic target protein further comprises a 
heterologous signal sequence (SS) at the amino terminus. In 
10 certain embodiments thereof, the heterologous signal 
M sequence is selected from the group consisting of (a) the 
|| Mellitin signal sequence of NH 2 - KFLVNVALVFMWYISYI YA - COOH 
m (SEQ ID NO:12), (b) the GP signal sequence of NH 2 - 

;| VRTAVL I LLLVRFSEP - COOH (SEQ ID NO : 13 ) , (c) the Hemagglutinin 
f=|5 signal sequence of NH 2 -KTIIALSYIFCLVFA-COOH (SEQ ID N0:14), 
ill (d) the rhodopsin tag 1 signal sequence of NH 2 - 
' MNGTEGPNFYVPFSNKTGWRSPFEAPQYYLAEP-COOH (SEQ ID NO: 15), and 

ftj (e) the rhodopsin tag ID4 signal sequence of NH 2 - 
^ GKNPLGVRKTETSQVAPA- COOH (SEQ ID NO: 16). In certain 

20 embodiments thereof the tag sequences further comprise a 
H hexahistidine sequence (SEQ ID NO: 17) and a decahist idine 
sequence (SEQ ID NO: 18). In yet certain embodiments thereof 
the hydrophobic target protein is selected from the group 
consisting of (a) GP67 SS-Myc tag-EE tag-Human m2 mAChR (SEQ 
25 ID N0:19), (b) Mellitin SS-Flag tag-Human Beta 2 Adrenergic 
Receptor-EE tag(SEQ ID NO:20), (c) Hemagglutinin SS-Human 
Neurokinin 3 Receptor-HSV tag-Myc tag (SEQ ID NO:21), (d) 
Mellitin SS-Flag tag-Human ml mAChR-EE tag (SEQ ID NO:22), 
and (e) Hemagglutinin SS-Rat m3 mAChR-HSV tag-OctaHis tag 
30 (SEQ ID NO: 23) . 

In a second aspect, the invention provides a method of 
isolating a hydrophobic protein, the method comprising (a) 
purifying the hydrophobic protein by sucrose gradient 
ultracentrifugation, (b) purifying the hydrophobic protein 
35 by antibody affinity purification, and (c) purifying the 
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hydrophobic protein by immobilized metal affinity 
chromatography . 

In certain embodiments of the second aspect, the 
hydrophobic protein comprises (a) at least one transmembrane 
5 domain sequence, {b) at least two tag sequences useful for 
affinity selection, and (c) a hydrophobic protein (HP) 
sequence. In certain embodiments thereof, the hydrophobic 
protein sequence is selected from the group consisting of 
(a) a membrane protein, (b) an integral membrane protein, 
10 (c) a transmembrane protein, (d) a monotopic membrane 
^* protein, (e) a polytopic membrane protein, (f) a pump 
p protein, (g) a channel protein, (h) a receptor kinase 
W protein, (i) a G protein-coupled receptor protein, (j) a 
f-i membrane-associated enzyme, and (k) a transporter protein. 
Q5 In certain embodiments of the second aspect, the tag 
™ ! sequences of the hydrophobic protein comprise epitope tag 
§>«> sequences selected from the group consisting of (a) a FLAG 
tag (NH2 -DYKDDDDK- COOH) (SEQ ID NO:l), (b) an EE tag (NH2- 
Jll EEEEYMPME - COOH ) (SEQ ID NO : 2 ) , (c) a hemagglutinin tag (NH2- 
-20 YPYDVPDYA-COOH) (SEQ ID NO : 3 ) , (d) a myc tag (NH2- 
r *- KHKLEQLRNSGA-COOH) (SEQ ID NO:4), (e) an HSV tag (NH2- 
QPELAPEDPED - COOH ) (SEQ ID NO: 5) and (f) a rhodopsin tag 
(NH2 MNGTEGPNFYVPFSNKTGWRS PFEAPQ YYLAE PWQFSM - COOH ) (SEQ ID 
NO: 6) . In certain embodiments of the second aspect, the 
25 hydrophobic protein comprises a sequence with an amino 
terminus to carboxy terminus order selected from the group 
consisting of (a) Tagl -Tag2 -HP , (b) Tagl-HP-Tag2 , and (c) 
HP-Tagl-Tag2 . 

In certain embodiments of the second aspect, the 
30 hydrophobic protein is selected from the group consisting of 
(a) Myc tag-EE tag-Human m2 mAChR (SEQ ID NO:7), (b) Flag 
tag-Human Beta 2 Adrenergic Receptor-EE tag (SEQ ID NO:8), 
(c) Human Neurokinin 3 Receptor-HSV tag-Myc tag (SEQ ID 
NO:9), (d) Flag tag-Human ml mAChR-EE tag (SEQ ID NO:10), 
35 and (e) Rat m3 mAChR -HSV tag-OctaHis tag (SEQ ID N0:11). In 
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embodiments thereof, the hydrophobic protein further 
comprises a heterologous signal sequence (SS) at the amino 
terminus. In certain embodiments thereof, the heterologous 
signal sequence is selected from the group consisting of (a) 
the Mellitin signal sequence of NH2 -KFLVWALVFMWYISYIYA- 
COOH (SEQ ID NO: 12), (b) the GP signal sequence of NH2- 
VRTAVLILLLVRFSEP- COOH (SEQ ID NO:13), (c) the Hemagglutinin 
signal sequence of NH2 - KTI IALS YIFCLVFA- COOH (SEQ ID NO: 14), 

(d) the rhodopsin tag 1 signal sequence of NH2- 

MNGTEGPNFYVPFSNKTGWRSPFEAPQYYLAEP - COOH (SEQ ID NO: 15), and 

(e) the rhodopsin tag ID4 signal sequence of NH2- 
GKNPLGVRKTETSQVAPA- COOH (SEQ ID NO: 16). In certain 
embodiments of the second aspect, the tag sequences of the 
hydrophobic protein further comprise a hexahistidine 
sequence (SEQ ID NO: 17) and a decahistidine sequence (SEQ ID 
NO: 18) . 

In certain embodiments of the second aspect, the 
hydrophobic target protein is selected from the group 
consisting of (a) GP67 SS-Myc tag-EE tag-Human m2 mAChR (SEQ 
ID NO:19), (b) Mellitin SS-Flag tag-Human Beta 2 Adrenergic 
Receptor-EE tag(SEQ ID NO:20), (c) Hemagglutinin SS-Human 
Neurokinin 3 Receptor-HSV tag-Myc tag (SEQ ID NO: 21), (d) 
Mellitin SS-Flag tag-Human ml mAChR- EE tag (SEQ ID NO:22), 
and (e) Hemagglutinin SS-Rat m3 mAChR-HSV tag-OctaHis tag 
(SEQ ID NO: 23) . 

In a third aspect, the invention provides an isolated 
nucleic acid molecule suitable for hydrophobic protein 
expression, comprising (a) a vector polynucleotide sequence 
for protein expression in a eukaryotic cell, and (b) a 
polynucleotide sequence encoding an engineered hydrophobic 
protein comprising the following elements (i) an N- terminal 
methionine residue, (ii) a heterologous signal sequence 
(SS) , (iii) at least one transmembrane domain sequence, (iv) 
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at least two tag sequences useful for affinity purification, 
and (v) a hydrophobic protein (HP) sequence. 

By the phrase "a vector polynucleotide sequence for 
protein expression in a eukaryotic cell" is meant any 
5 polynucleotide sequence comprising an origin of replication 
allowing replication in a eukaryotic cell, a selectable 
marker, e.g., antibiotic resistance marker, and a promoter 
sequence element to promote transcription of the structural 
gene, which may be viral, prokaryotic or eukaryotic in 
10 origin. The origin of the vector polynucleotide sequence 
i*l may be viral, prokaryotic or eukaryotic or a combination 
O thereof. As will be understood in the art, the vector 
; sequence while being designed for expression in a eukaryotic 
Q cell may optionally contain a prokaryotic origin of 
M.5 replication. Non- limiting examples of suitable vector 

polynucleotide sequences include the following: baculovirus 
M vectors such as pVL13 92 (Pharmingen, San Diego, CA) and 
r; pBAC-l (Novagen, Madison, WI) and mammalian expression 
m vectors such as pcDNA 3.1 (Invitrogen, San Diego, CA) and 
pTriEx-1 (Novagen, Madison, WI ) . 

The appropriate DNA sequence may be inserted into the 
vector by a variety of procedures . In general , the DNA 
sequence is inserted into an appropriate restriction 
endonuclease site(s) by procedures known in the art. Such 
25 procedures and others are deemed to be within the scope of 
those skilled in the art. 

In certain embodiments, the present invention relates 
to host cells containing the above -described constructs. The 
host cell can be a higher eukaryotic cell, such as a 
30 mammalian cell, or a lower eukaryotic cell, such as a yeast 
cell, or the host cell can be a prokaryotic cell, such as a 
bacterial cell. Introduction of the construct into the host 
cell can be effected by any means known in the art, 
including but not limited to transduction or transformation 
35 or transfection or electroporat ion . 
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Examples of a suitable heterologous signal sequences 
(SS) include the honey bee mellitin SS (NH2- 
MKFLVNVALVFMWY I S Y I YA - COOH ) (SEQ ID NO: 12) (Tessier D.C., 
(1991) Gene 98: 177), see also Invitrogen . com' s pMelBac 
product) , the baculovirus gp67 SS (NH2 -MVRTAVLILLLVRFSEP- 
COOH) (SEQ ID NO: 13) (Kretzschmar T. , et al . , (1996) J". 
Immunol Methods 195: 93-101), the Influenza A virus 
hemagglutinin SS (NH2 -MKTI IALSYIFCLVFA-COOH) (SEQ ID NO : 14 ) 
(Verhoeyen, M. , (1980) Nature 286: 771-776), the rhodopsin 
tag 1 SS ( NH2 - MNGTEGPNFYVPFSNKTGWRS PFEAPQYYLAEP - COOH ) (SEQ 
ID NO:15), or the rhodopsin tag ID4 SS (NH2- 
GKNPLGVRKTETSQVAPA-COOH) (SEQ ID NO: 16). Generally, such 
suitable SSs will be less that 75 aa long. These signal 
sequences may or may not be cleaved from the protein upon 
expression in animal cells. For example, the rhodopsin tag 
1 is not cleaved off of the protein when presented by the 
cell to the plasma membrane, whereas rhodopsin tag ID4 
signal sequence is cleaved off. 

Non-limiting examples of a suitable epitope affinity 
tags include FLAG (NH2 -DYKDDDDK-COOH) (SEQ ID NO:29), "EE" 
(NH2-EEEEYMPME-COOH) (SEQ ID NO : 3 0 ) , hemagglutinin (NH2 - 
YPYDVPDYA-COOH) (SEQ ID NO: 31), myc (NH2 -KHKLEQLRNSGA- COOH) 
(SEQ ID NO:32), or herpes simplex virus tag ("HSV"; NH2- 
QPELAPEDPED-COOH) (SEQ ID NO: 33) . 

These design elements of the hydrophobic protein are 
arrayed from amino to carboxy terminus in one of the 
following permutations: (1) SS-Tagl-Tag2 -HP; (2) SS-Tagl-HP- 
Tag2; (3) SS-HP-Tagl-Tag2 . 

In certain embodiments thereof, the N- terminal 
methionine sequence and the heterologous signal sequence are 
selected from the group consisting of (a) 
MKFLVNVALVFMWYI SYI YA (SEQ ID NO:24), (b) MVRTAVLILLLVRFSEP 
(SEQ ID N0:25), (c) MKTI IALSYI FCLVFA (SEQ ID NO:26) (d) 
MMNGTEGPNFYVPFSNKTGWRS PFEAPQYYLAEP -COOH (SEQ ID NO: 27) and 
(e) MGKNPLGVRKTETSQVAPA- COOH (SEQ ID NO: 28) . 

35 
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In certain embodiments thereof, the tag sequences 
comprise epitope tag sequences selected from the group 
consisting of (a) a FLAG tag ( NH2 - D YKDDDDK- COOH ) (SEQ ID 
NO:l), (b) an EE tag (NH2 -EEEEYMPME-COOH) (SEQ ID NO : 2 ) , (c) 
5 a hemagglutinin tag ( NH2 - YPYDVPDYA- COOH ) (SEQ ID NO:3), (d) 
a myc tag (NH2 -KHKLEQLRNSGA-COOH) (SEQ ID NO:4), (e) an HSV 
tag (NH2 -QPELAPEDPED-COOH) (SEQ ID NO : 5 ) , and (f) a 
rhodopsin tag (NH2 MNGTEGPNFYVPFSNKTGWRSPFEAPQYYLAEPWQFSM- 
COOH) (SEQ ID NO: 6) . 
10 In certain embodiments of the third aspect, the 

U elements of the engineered hydrophobic protein are arrayed 
from an amino to carboxy terminus order selected from the 
ftf group consisting of (a) SS-Tagl-Tag2-HP, (b) SS-Tagl-HP- 
f- Tag2, and (c) SS-HP-Tagl-Tag2 . In embodiments thereof, the 
r«15 engineered hydrophobic protein is selected from the group 
$ consisting of (a) GP6 7 -Myc -EE -Human m2 mAChR (SEQ ID 
J\ NO:19), (b) Mellitin-Flag Tag-Human Beta 2 Adrenergic 
fIJ Receptor-EE (SEQ ID NO:20), and (c) Hemagglutinin SS-Human 
^ Neurokinin 3 Receptor-HSV-Myc (SEQ ID NO: 21) . In a further 
tQO embodiment of the third aspect, the tag sequences further 
H comprise a hexahist idine sequence (SEQ ID NO: 17) and a 
decahistidine sequence (SEQ ID NO: 18). In certain 

embodiments of the third aspect, the engineered hydrophobic 
protein is selected from the group consisting of (a) GP67- 
25 Myc -EE -Human m2 mAChR (SEQ ID NO:19), (b) Mellitin-Flag Tag- 
Human ml mAChR- EE (SEQ ID NO:20) , and (c) Hemagglutinin SS- 
Rat m3 mAChR-HSV-OctaHis (SEQ ID NO: 21) . 

The HP sequence of the isolated polynucleotide may be 
any polynucleotide encoding a protein that is isolated with 
30 amphiphile present. Alternatively, the term may also refer 
to any polynucleotide encoding a protein for which, when 
purified to greater than 1% purity or purified to greater 
than 10% purity or purified to greater than 25% purity or 
purified to greater than 50-99% purity, either requires or 
35 benefits from the presence of an amphiphile for functional 
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assays, to enhance stability (shelf -life or ability to 
withstand freeze- thaw cycles) , or to retain conformational 
integrity as observed by common laboratory techniques 
including ligand binding assays, circular dichroism, 
hydrodynamic assessments of mean size, shape, or density, 
interaction with conformation- specif ic antibodies. In a 
preferred embodiment the hydrophobic protein of the 
invention is a mammalian hydrophobic protein. In a 

particularly preferred embodiment, the hydrophobic protein 
of the invention is a human hydrophobic protein. 

The HP sequence of the isolated polynucleotide may 
include polynucleotides encoding proteins that are 
identified by bioinf ormatics-assisted means through the use 
of the following non-limiting examples of algorithms 
designed for the identification of hydrophobic proteins: (a) 
DAS - Prediction of transmembrane regions in prokaryotes 
using the Dense Alignment Surface method (Stockholm 
University) Cserzo, M. et al . (1997) Prot. Eng. 10:673- 
676; (b) HMMTOP - Prediction of transmembrane helices and 
topology of proteins (Hungarian Academy of Sciences) G.E 
Tusnady and I. Simon (1998) J. Mol . Biol. 283: 489-506; (c) 
Hidden Markov Model Predictions Sonnhammer, E.L.L. et al . 
(1998) A hidden Markov model for predicting transmembrane 
helices in protein sequences. Proc . of the Sixth Intern. 
Conf. on Intelligent Systems for Molecular Biology (ISMB98) , 
175-182; (D) TMAP - Transmembrane detection based on 
multiple sequence alignment (Karolinska Institut; Sweden) No 
reference available: see URL at http//www . mbb . ki . se/tmap/ ; 
and (e) TopPred 2 - Topology prediction of membrane proteins 
(Stockholm University), von Heijne, G. (1992) J. Mol. Biol. 
225:487-494 and Cserzo, M. et al . (1997) Prot. Eng. 10:673- 
676. 

Representative non-limiting examples of proteins that 
may be encoded by the HP polynucleotide of the invention are 
presented herein in Table 1. 
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All nucleic acid sequences and the respective amino 
acid sequences encoded thereby identified above by the 
appropriate GenBank accession numbers are herein 
incorporated by reference. 

In a fourth aspect, the invention provides a method for 
identifying a ligand for a hydrophobic protein, the method 
comprising (a) selecting a hydrophobic target protein from 
the group consisting of (i) a membrane protein, (ii) an 
integral membrane protein, (iii) a transmembrane protein, 
(iv) a monotopic membrane protein, (v) a polytopic membrane 
protein, (vii) a pump protein, (viii) a channel protein, 
(iX)a receptor kinase protein, (X) a G protein-coupled 
receptor protein, Xii) a membrane-associated enzyme, and 
(Xiii) a transporter protein, wherein the hydrophobic 
protein is bound by amphiphile selected from the group 
consisting of (i) a polar lipid, (ii) an amphiphilic 
macromolecular polymer, (iii) a surfactant or detergent, and 
(iV) an amphiphilic polypeptide; (b) selecting a ligand 
molecule using multi-dimensional chromatography by affinity 
selection by exposing under homogenous or heterogeneous 
solution phase conditions the hydrophobic target protein 
bound by an amphiphile to a multiplicity of molecules from a 
mass-coded library to promote the formation of at least one 
complex between the hydrophobic target protein and the 
ligand molecule, (c) separating the complex from the unbound 
molecules, and (d) identifying the ligand molecule by mass 
spectral analysis. 

In a fifth aspect, the invention provides a method for 
identifying a ligand for a hydrophobic protein, the method 
comprising (a) selecting a hydrophobic target protein from 
the group consisting of (i) a membrane protein, (ii) an 
integral membrane protein, (iii) a transmembrane protein, 
(iv) a monotopic membrane protein, (v) a polytopic membrane 
protein, (vii) a pump protein, (viii) a channel protein, 
(iX)a receptor kinase protein, (X) a G protein-coupled 
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receptor protein, Xii) a membrane-associated enzyme, and 
(Xiii) a transporter protein, wherein the hydrophobic 
protein is bound by amphiphile selected from the group 
consisting of (i) a polar lipid, (ii) an amphiphilic 
macromolecular polymer, (iii) a surfactant or detergent, and 
(iV) an amphiphilic polypeptide; (b) selecting a ligand 
molecule using multi -dimensional chromatography by affinity 
selection by exposing under homogenous or heterogeneous 
solution phase conditions the hydrophobic target protein 
bound by an amphiphile to a multiplicity of molecules from a 
library that is not mass -coded to promote the formation of 
at least one complex between the hydrophobic target protein 
and the ligand molecule, (c) separating the complex from the 
unbound molecules, and (d) identifying the ligand molecule 
by mass spectral analysis. 

In a sixth aspect, the invention provides a method of 
isolating a hydrophobic protein, the method comprising: (a) 
selecting a hydrophobic protein comprising: (i) at least one 
transmembrane domain sequence, (ii) at least two tag 
sequences useful for affinity selection selected from the 
group consisting of: (A) a FLAG tag (NH2 -DYKDDDDK-COOH) (SEQ 
ID NO: 29) , (B) an EE tag (NH2 - EEEEYMPME - COOH ) (SEQ ID 

NO:30), (C) a hemagglutinin tag (NH2 -YPYDVPDYA-COOH) (SEQ ID 
NO:31), (D) a myc tag (NH2 -KHKLEQLRNSGA-COOH) (SEQ ID 
NO:32), and (E) an HSV tag (NH2 -QPELAPEDPED-COOH) (SEQ ID 
NO:33); (iii) a hydrophobic protein (HP) sequence selected 
from the group consisting of: (A) a membrane protein, (B) an 
integral membrane protein, (C) a transmembrane protein, (D) 
a monotopic membrane protein, (E) a polytopic membrane 
protein, (F) a pump protein, (G) a channel protein, (H) a 
receptor kinase protein, (I) a G protein-coupled receptor 
protein, (J) a membrane-associated enzyme, and (K) a 
transporter protein; (b) purifying the hydrophobic protein 
by sucrose gradient ultracentrifugation; (c) purifying the 
hydrophobic protein by antibody affinity purification; and 
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(d) purifying the hydrophobic protein by immobilized metal 
affinity chromatography. 

In a seventh aspect, the invention provides, an 
isolated nucleic acid molecule suitable for hydrophobic 
protein expression, comprising: (a) a vector polynucleotide 
sequence for protein expression in a eukaryotic cell, and 
(b) a polynucleotide sequence encoding an engineered 
hydrophobic protein comprising the following elements (i) an 
N-terminal methionine residue, (ii) a heterologous signal 
sequence (SS) , wherein the N-terminal methionine sequence 
and the heterologous signal sequence are selected from the 
group consisting of (1) MKFLVNVALVFMWYISYIYA (SEQ ID 
NO:24), (2) MVRTAVLILLLVRFSEP (SEQ ID NO:25), (3) 
MKTIIALSYIFCLVFA (SEQ ID NO:26), (4) 

MMNGTEGPNFYVPFSNKTGWRSPFEAPQYYLAEP - COOH (SEQ ID NO: 27) and 
(5) MGKNPLGVRKTETSQVAPA- COOH (SEQ ID NO:28); (iii) at least 
one transmembrane domain sequence, (iv) at least two tag 
sequences useful for affinity selection selected from the 
group consisting of (1) a FLAG tag ( NH 2 - D YKDDDDK - COOH ) (SEQ 
ID NO:l), (2) an EE tag (NH2 -EEEEYMPME-COOH) (SEQ ID N0:2), 

(3) a hemagglutinin tag ( NH 2 - Y P YDVPD YA - COOH ) (SEQ ID NO:3), 

(4) a myc tag (NH2 -KHKLEQLRNSGA-COOH) (SEQ ID NO : 4 ) , and (5) 
an HSV tag (NH2 -QPELAPEDPED-COOH) (SEQ ID NO:5), and (v) a 
hydrophobic protein (HP) sequence selected from the group 
consisting of (1) a membrane protein, (2) an integral 
membrane protein, (3) a transmembrane protein, (4) a 
monotopic membrane protein, (5) a polytopic membrane 
protein, (6) a pump protein, (7) a channel protein, (8) a 
receptor kinase protein, (9) a G protein-coupled receptor 
protein, (10) a membrane -associated enzyme, and (11) a 
transporter protein. 

The following examples are intended to further 
illustrate certain preferred preferred embodiments of the 
invention but are not meant to limit the scope of the 
invention in any way. 
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EXAMPLES 

EXAMPLE Is AFFINITY SELECTION OF COX-1 LIGANDS AND 

IDENTIFICATION BY ALIS 

Purified ovine COX-1 (>95% by SDS-PAGE) from Cayman 
Chemical Company (Ann Arbor, MI) was prepared for screening 
by exchanging the detergent . To remove the detergent in 
which the protein was supplied, Tween-20, ion exchange 
chromatography was conducted. Approximately 6 raL of 0.27 
mg/mL COX-1 in a buffer of 80 mM Tris-8.0, 0.09% Tween-20, 
270 UM DDC, and 240 fiM dodecyl-p-D-maltoside (Dp"M) , which 
would provide a theoretical yield of 1.8 mg, was applied to 
a Poros HQ column with a buffer of 8 0 mM tris, pH 8.0, 240 
MM DfiM (TBS-AGD) . The column was developed with a linear 
gradient from 0 to 0.5 M NaCl over 10 minutes with a flow 
rate of 5 mL/minute. The eluted protein fractions were 
identified by monitoring absorbance at 280 nm. 

After this treatment, 18 mL of protein-containing 
material were pooled and concentrated in an Amicon-30 
centricon according to the manufacturer's instructions 
(Millipore, Inc.; Bedford, MA). This yielded a 

concentration of approximately 1.8 mg/ml of COX-1 which was 
diluted to 1.3 mg/mL (20 |1M COX-1) for screening. It is 
estimated that the buffer in the final protein preparation 
consisted of 2.4 mM D(3m, 80 mM tris-8.0, about 50 mM NaCl . 
This COX-1 solution was promptly supplemented with 2 0 |LlM 
hemin, and 3 00 |1M diethyldithiocarbamate (DDC) in accordance 
with the handling procedures used by Cayman Chemical 
company . 

The COX-1 protein preparation was then used for sample 
preparation according to the appended sample preparation 
standard operating procedure (SOP, Table 2) . 
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This sample prep SOP yields binding assays that combine COX- 
1 with mass -encoded combinatorial libraries made of 
approximately 2500 small drug-like molecules, each member at 
5 a concentration of 1 |LlM. The sample was incubated 30- 

minutes at 4°C. As the total volume was 12 flL, the sample 
thus contained 24 0 pmol protein and 12 pmol of each library 
component. As a control test, an unrelated membrane 
protein, diacyl glycerol kinase (Calbiochem, Inc.; San 
10 Diego, CA) , was also prepared in the same buffer at 2 0 JIM 

and incubated with the same 2500-member library and treated 
similarly. 

Then the mixtures were individually subjected to ALIS 
Analysis. If any ligand of suitably high affinity was bound 

15 to the COX at the time its fraction was collected, the mass 
spectral analysis would identify its mass. By virtue of the 
mass-coding, the precise combination of building blocks and 
core molecule can be identified (see U.S. Patent No. 
6,147,344). If the same compound failed to appear in the 

20 diacyl glycerol kinase control experiment, the compound may 
then be identified as a specific ligand of the COX-1. 

The mixtures were then individually subjected to 
modified ALIS analysis as follows. The large detergent- 
solubilized protein was separated from the small drug- like 

25 molecules by size exclusion chromatography (SEC) over a 
4.6mm x 50 mm x 5 Jim SEC column at 0°C using a running 
buffer of TBS (80 mM tris, pH 8.0, 150 mM NaCl, 2.5% DMSO) 
at a flowrate of 2 mL/minute. The eluting SEC fraction- 
containing protein was identified by UV-VIS detection 

30 monitoring at 23 0 nm and transferred by way of a sample loop 
to a low- flow (100 jlL/minute) reverse -phase chromatography 
(RPC) system. The RPC column (Higgins C-18; 1 mm x 50 mm x 
5 Jim) is maintained at 60°C to promote dissociation of 
ligands from the complex. From this RPC column, the ligand 
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is eluted into a high- resolution mass spectrometer for 
analysis using a gradient of 5%-95% acetonitrile (0.1% 
formic acid counterion) in water (w/ 0.1% formic acid) over 
5 minute. If any ligand of suitably high affinity was bound 
5 to the COX at the time its fraction was collected, the mass 
spectral analysis will identify its mass. By virtue of the 
mass encoding the precise combination of building blocks and 
core can be identified (see U.S. Patent No. 6, 147,344 by 
Annis et al.) . If the same compound failed to appear in the 
10 diacyl glycerol kinase control experiment, the compound 
would be identified as a specific ligand of the COX-1. 

Pi Figure 6 illustrates the separation of protein from unbound 

ill small molecules using ALIS. 

%l Control experiments demonstrated that COX-1 screened in 

pl5 this manner enabled known COX-1 ligands to be extracted from 
large mixtures of small molecules (Figure 7) . When a test 
M library composed of 25 UM meclof enamate , 25 |IM indomethacin, 
ul 1 \lM each of various test libraries, these known COX-1 
•ft ligands are recovered and identified by the ALIS screening 
:§0 method . 

This experiment demonstrated that after screening over 
330,000 small drug-like molecules, 41 small molecules were 
identified as COX-1 ligands, one example of which is shown 
in Figure 8. These COX-1 ligand molecules were identified 
25 in two screens against small libraries, and their single 
molecule formulations are in preparation for further 
testing. Furthermore, many of these hits are observed to 
compete with known COX-1 ligand, meclof enamate , for binding 
(Figure 9) . 

30 

EXAMPLE 2: IDENTIFICATION OF LIGAND BINDING TO m2 mAChR 

PROTEIN BY MASS SPECTROSCOPY 

A gene construct encoding the m2 subtype of the 

muscarinic acetylcholine receptor (m2R) was cloned into a 

35 baculovirus expression vector according to conventional 
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cloning methods (see e.g., Baculovirus Expression Vector 
System , 6th Edition, 1999, Pharmingen, San Diego, CA) . The 
gene construct encoded a polypeptide with an amino terminal 
methionine followed immediately in frame by the melittin 
signal sequence (SEQ ID NO: 12) followed immediately in frame 
by the FLAG Ml epitope tag (SEQ ID NO:l) followed 
immediately in frame by the sequence for the m2 muscarinic 
acetylcholine receptor (NCBI Accession No. X04708) . The 
full-length polypeptide sequence therefore was: 

NH2- 

MKFL VNVAL VFM WY I S Y I YAD YKDDDD KMMNNS TNS SNS GLALT S PYKT 
F EWP I VLVAG S L S L VT 1 1 GN I LVMVS I KVNRHLQT VNN YFL F S LAC ADL 
1 1 GVFSMNL YTL YTVI GYWPLGP WCDLWLALDYWSNAS VMNLL IIS FD 
RYFCVTKPLTYPVKRTTKMAGMMIAAAWVLS F I LWAPAI LFWQF I VGVRT 
VEDGECYIQFFSNAAVTFGTAIAAFYLPVI IMTVLYWHI SRASKSRIKKD 
KKE P VANQE P VS P S L VQGR I VKPNNNNMPGS DE ALEHNKI QNGKAPRD AV 
TENCVQGEEKESSNDSTSVSAVASNMRDDEITQDENTVSTSLGHSKDENS 
KQTCIKIVTKTQKSDSCTPANTTVELVGSSGQNGDEKQNIVARKIVKMTK 
QPAKKKPPPSREKKVTRTILAILLAFI ITWAPYNVMVLINTFCAPCI PNT 
VWTIGYWLCYINSTINPACYALCNATFKKTFKHLLMCHYKNIGATR- 
COOH (SEQ ID NO: 34) 

Upon expression, the mellitin signal sequence is cleaved after Ala(21) revealing 
an amino terminal FLAG epitope which is bound specifically by the FLAG Ml antibody 
resin (Sigma; St. Louis, MO). This baculovirus expression vector was used to generate 
baculovirus that directed the expression of the above polypeptide in insect cells according 
the conventional methods (Baculovirus Expression Vector System, 6th Edition, 1999, 
Pharmingen, San Diego, CA). 

To purify FLAG-tagged m2R, 60 g of insect cells expressing the above 

polypeptide were suspended in 0.6 L of TBS [50 mM Tris-CL, pH 7.4, 100 mM NaCl), 

and the sample was homogenized by nitrogen cavitation. The homogenate was subjected 

to centrifugation at 500 x g for 30 minutes at 4° C to removed non-homogenized cells. 

The pellet was discarded and the supernatant was subjected to ultracentrifugation at 
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100,000 x g for 45 minutes at 4° C. The ultracentrifugation supernatant was discarded 
and the pelleted cell membranes were resuspended in TBS containing 0.5% (w/v) 
digitonin (TBS-D buffer) to a protein concentration of 2.5 mg/mL. This suspension was 
incubated, stirring for 60 minutes at 4°C before ultracentrifugation at 100,000 x g for 45 
minutes at 4° C to pellet insoluble material. The soluble supernatant was applied to a 5 
rriL column of the FLAG Ml antibody resin pre-equilibrated with TBS-D buffer at a flow 
rate of 0.7 mL/min for antibody affinity purification. After loading the soluble material 
onto the FLAG Ml antibody resin, the column was washed with 50 mL of TBS-D buffer 
and then the FLAG-tagged m2R protein was eluted from the column with TBS-D buffer 
containing 100 pg/rnL of FLAG peptide (Sigma; St. Louis, MO). Eluted column 
fractions containing purified FLAG-tagged m2R were identified by SDS-PAGE. 

To assess the concentration of purified FLAG-tagged m2R protein that was 
capable of binding muscarinic ligands, glass fiber filter-binding assays were performed 
according to the method of Peterson (Peterson, G.L., et al (1995) J. Biol. Chem. 
270:17808). 

This m2R preparation consisted of 6 p_M m2R in TBS-D with 100 TJg/mL of 
FLAG peptide. Each m2R polypeptide is reversibly associated with a multiplicity of 
digitonin molecules, creating a membrane protein-detergent complex with a 
stoichiometry of m2R:digitonin of 1:5-500. This preparation was designated as Stock 
m2R for use in ALIS sample preparation according to the sample preparation protocol 
outlined below. 

Other stock reagents were prepared. Stock cyclooxygenase 1 (COX) was 
prepared according to the method in Example 1 to yield a concentration of 6 uM COX in 
TBS. Stock discrete ligands (Sigma; St.Louis, MO) pirenzepine, quinuclidinyl benzylate 
(QNB), and atropine were prepared to 400 pM in TBS. Four combinatorial chemical 
libraries were prepared in dimethyl sulphoxide (DMSO) constituting on average 2500- 
5000 small drug-like molecules each at a concentration of 400 pM. These four drug 
libraries were designated NMG-66, NGM-41, NGL-10-A-41, and NGL-1 16-A-470. Four 
stock test libraries were prepared containing 400 pM atropine by adding atropine 
dissolved in DMSO to the individual drug libraries mentioned above to yield Stock 
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NMG-66 Plus Atropine, Stock NGM-41 Plus Atropine, Stock NGL-10-A-41 Plus 
Atropine, and Stock NGL-1 16-A-470 Plus Atropine. Four stock test libraries were 
prepared containing 400 pM QNB by adding QNB dissolved in DMSO to the individual 
drug libraries mentioned above to yield Stock NMG-66 Plus QNB, Stock NGM-41 Plus 
QNB, Stock NGL-10-A-41 Plus QNB, and Stock NGL-1 16-A-470 Plus QNB. Premix 
Buffer was prepared by combining 100 pL of 5% digitonin, with 4.6 mL water, and 400 
pL 1 M Tris-Cl, pH 7.5, then equilibrated at 42°C. 

Binding reactions were prepared that combined protein (either m2R test protein or 
COX control protein) with ligand (either QNB alone, atropine alone, pirenzepine alone, 
atropine plus drug library, or QNB plus drug library) or protein with DMSO as a control. 
In each case, 38 pL of Premix Buffer was dispensed into polypropylene tubes containing 
2 pL of DMSO or DMSO-solubilized ligands, mixed by vortexing, and centrifuged at 
8,000 x g for 10 minutes at room temperature to remove insoluble material. The clarified 
supernatants (2 pL) containing either aqueous DMSO alone or aqueous DMSO- 
solubilized ligands (with or without drug libraries) were transferred to polypropylene 
tubes at 4° C. Target protein (8 pL) m2R or control (8 pL) protein COX was added to 
the supernatants, mixed well by pipetting, and incubated at 4° C for 60 minutes. 

These binding reaction preparations were then subjected to ALIS analysis. The 
binding reaction preparations combine 4.8 pM target membrane protein (m2R) or 4.8 pM 
control membrane protein (COX) with a multiplicity of approximately 2500 small drug 
like molecules each at a concentration of 1-10 pM in a manner that established 
equilibrium binding conditions. High affinity ligands (K id <100 pM) to the proteins are 
then identified by ALIS. Small molecule ligands of the target protein m2R that do not 
also bind to the control protein COX are considered as specific ligands of the target 
protein. Separately, for comparison to experiments with a multiplicity of drug molecules, 
binding reactions were also prepared combining m2R or COX with individual (discrete) 
m2R ligands. This ALIS Analysis proceeded as described below with a series of size 
exclusion chromatography (SEC), followed by reverse phase chromatography (RPC), 
followed by mass spectrometric (MS) analysis. 
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These prepared binding reaction mixtures were individually subjected to ALIS 
Analysis. Ligands that bound to the m2R with suitably high affinity were collected with 
the protein-containing SEC fraction as follows. The large detergent-solubilized 
molecules were separated from the unbound small drug-like molecules by SEC over a 4.6 
mm x 50 mm x 5 \im SEC column at 4°C using a running buffer of 50 mM Tris-Cl, pH 
7.5, 150 mM NaCl, 2.5% DMSO at a flowrate of 2 mL/minute. The protein containing 
fraction was identified with ultraviolet electronic absorption spectrometry monitoring at 
230 nm. 

The protein-containing fraction was transferred by way of a sample loop to a low 
flowrate (100 uL/min RPC system. The RPC column (Higgins C-18; 1 mm x 50 m x 5 
um) is maintained at 60 ° C to promote ligand dissociation from the complex. From this 
RPC column the ligand in eluted into a high-resolution mass spectrometer for analysis 
using a gradient of 5%-95% acetonitrile (0.1% formic acid counterion) in water (w/ 0.1% 
formic acid) over 5 minutes. Mass analyzed ligands collected with the protein-containing 
SEC fraction by virtue of its high affinity for the protein-detergent m2R complex were 
identified by fore-knowledge of their precise mass. 

Experiments demonstrated that m2R screened by ALIS Analysis in this manner 
enabled known m2R ligands to be extracted from mixtures of a multiplicity of small drug 
like molecules by virtue of the known m2R ligands' high affinity for the m2R-detergent 
complex. This ALIS -formatted screen recovered m2R ligands from drug libraries just as 
well as it recovered m2R ligands bound to the m2R-detergent complex in the absence of 
drug libraries (Figure 10). 

Data for these experiments are presented in Figure 10. In these experiments, 
equilibrium binding reactions were established by combining 2.0 \iM m2R with 10 uM 
pirenzepine, QNB, or atropine in the absence or presence of combinatorial drug libraries 
(NGM-66, NGM-340, NGL-10-A-41, NGL-1 16-A-470) that presented approximately 
2500 small drug-like molecules each at a concentration of 1-10 uM. The binding 
reactions were subjected to ALIS Analysis and the extent of ligand recovery was 
quantified by the signal strength of the mass spectrometer. The x-axis is in relative units 
of mass spectrometric signal response for the respective masses of pirenzepine, QNB, and 
atropine. 
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EXAMPLE 3: DISPLACEMENT ASSAY IN AN AFFINITY-BASED 

SELECTION AND ALIS IDENTIFICATION OF HP 
LIGANDS 

As a representative HP, m2 mAChR is purified according 
to the method of Peterson et al . (Peterson, G.L. et al . 
(1995) J". Biol. Chem. 270: 17808). The target protein is 
adjusted to a concentration of 20 [iM in a buffer of TBS-AGD 
and is incubated with a known muscarinic ligand, pirenzepine 
(MW = 424.3), which is at a concentration of 1 |iM. As a 
control test, an unrelated membrane protein, glycophorin, is 
also prepared in TBS-AGD at 20 \3M and is incubated with the 
compound pirenzepine (MW = 424.3), which is at a 
concentration of 1 |J,M. 

A library of mass-coded compounds is added to the m2 
mAChR/pirenzepine mixture, and the sample is analyzed to 
determine if a library compound displaces pirenzepine from 
the HP protein. Displacement of pirenzepine indicates that 
the library compound binds more tightly and at the same site 
as pirenzepine itself. 

The displacement assay is conducted as follows. The 
mixture of m2 mAChR, 1 |j,m pirenzepine, and 1 U-M of each 
library compound are subjected to ALIS Analysis. In this 
analysis MS signal corresponding to the mass of pirenzepine 
is monitored, while the mass of the library compounds are 
ignored. If the library compound quantitatively reduces the 
MS signal corresponding to the mass of pirenzepine, it is 
inferred that the library compound displaced pirenzepine 
from the HP protein, in this case mAChR protein. If 
incubation with the library compounds does not alter the MS 
signal corresponding to the mass of pirenzepine, the library 
compounds are considered not to contain an m2 mAChR ligand. 
By comparing pirenzepine MS signal with and without 
incubation with the library compounds, one can assess 
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whether the library compound displaces pirenzepine and thus 
binds to the mAChR protein. 



EXAMPLE 4: AFFINITY SELECTION OF m2 mACHR MASS-CODED 

LIGANDS AND IDENTIFICATION BY ALIS 

As a representative HP, m2 mAChR is purified according 

to the method of Peterson et al . (Peterson, G.L. (1995) J. 

Biol. Chem 270: 17808). The protein is adjusted to 20 uM 

in a buffer of TBS-AGD is incubated with a 2500 -member 

library of mass-coded compounds, each member at a 

concentration of 1 uM. After a 3 0 minute incubation at 

22°C, the sample was chilled at 4°C pending ALIS analysis. 

As a control test, an unrelated membrane protein, 

glycophorin, is also prepared in TBS-AGD at 2 0 |_iM and 

incubated with the mass-coded library. To determine if the 

compound specifically binds to the mAChR, the mAChR -compound 

mixture is analyzed by ALIS Analysis. If a mass 

corresponding to one of the members of the mass-coded 

library appears when the protein peak is collected and 

surveyed by MS, that compound may be identified as a binding 

ligand. if the same compound fails to appear in the 

glycophorin control experiment, the compound may then be 

identified as a specific ligand of the mAChR. By virtue of 

the mass encoding the precise combination of building blocks 

and core can be identified (see U.S. Patent No. 6,147,344 by 

Annis et al.). Using MS-MS analysis (see U.S. Patent No. 

6,14 7,344 by Annis et al.), the exact structure of the core 

plus building block combination can also be pinpointed. 



EXAMPLE 5: AFFINITY SELECTION OF COX-1 LIGANDS AND 

IDENTIFICATION BY MODIFIED ALIS WITH ON-LINE 
FLUORESCENCE DETECTION 

As a representative HP, COX-1 protein was purified from 
ram seminal vesicles according to the method of Johnson et 
al. (Johnson, J.L. (1995) Arch. Biochem. Biophys. 324:26- 
34) . The COX-1 sample is adjusted to 20 [J,M in TBS-AGD (50 
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mM tris, pH 8.0, 150 mM NaCl, 800 mM dodecyl p-D-maltoside , 
2.5% DMSO) is mixed and incubated with a 2500-member library 
of mass-coded compounds, each member at a concentration of 1 
|i.M. After a 30 minute incubation at 22°C, the sample was 
5 chilled at 4°C pending ALIS analysis. As the total volume 
is 12 (J.L, the sample thus contains 24 0 pmol protein and 12 
pmol of each library component. As a control test, an 
unrelated membrane protein, glycophorin, is also prepared in 
TBS-AGD at 2 0 and incubated with the same 2 50 0-member 

. ,10 library and treated similarly. 

Then the mixtures are individually subjected to 
modified ALIS analysis as follows. The large detergent- 
„j solubilized protein is separated from the small drug-like 
j:| molecules by size exclusion chromatography (SEC) over a 
4.6mm x 50 mm x 5 |0.m SEC column at 0°C using a running 
I buffer of TBS (50 mM tris, pH 8.0, 150 mM NaCl, 2.5% DMSO) 
at a flowrate of 2 mL/minutes. The eluting SEC fraction 
*^ containing protein is identified by on-line fluorescence 
q detection exciting at 240-250 nm and monitoring emission at 
|«20 340 nm and transferred by way of a sample loop to a low-flow 
(100 |J,L/minute) reverse-phase chromatography (RPC) system. 
The RPC column (Higgins C-18; 1 mm x 50 mm x 5 \im) is 
maintained at 60°C to promote dissociation of ligands from 
the complex. From the RPC column, the ligand is eluted into 
25 a high-resolution mass spectrometer for analysis using a 
gradient of 5%-95% acetonitrile (0.1% formic acid 
counterion) in water (w/ 0.1% formic acid) over 5 minutes. 
Ligand of suitably high affinity bound to the COX-1 at the 
time its fraction is collected, and the mass of the ligand 
30 is identified by mass spectral analysis. 

Through the mass coding, the precise combination of 
building blocks and core molecule are identified (see U.S. 
Patent No. 6,147,344 by Annis et al . ) . If the same compound 
fails to appear in the glycophorin control experiment, the 
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compound may then be identified as a specific ligand of the 
COX-1 protein. 



EXAMPLE 6: AFFINITY SELECTION OF COX-1 LIGANDS AND 

IDENTIFICATION BY MODIFIED ALIS WITH ON-LINE 
LIGHT SCATTERING DETECTION 

As a representative HP, COX-1 protein was purified from 
ram seminal vesicles according to the method of Johnson efc 
al. (Johnson, J.L. (1995) Arch. Biochem. Biophys. 324:26- 
34) . The COX-1 sample is adjusted to 20 \iM in TBS-AGD (50 
mM tris, pH 8.0, 150 mM NaCl , 800 mM dodecyl (3-D-maltoside , 
2.5% DMSO) is mixed and incubated with a 25 0 0 -member library 
of mass-coded compounds, each member at a concentration of 1 
\iM. After a 30 minute incubation at 22°C, the sample is 
chilled at 4°C pending modified ALIS analysis. As the total 
volume is 12 |iL, the sample contains 240 pmol protein and 12 
pmol of each library component. As a control test, an 
unrelated membrane protein, glycophorin, is also prepared in 
TBS-AGD at 2 0 (J.M and incubated with the same 2500 -member 
library and treated similarly. 

Next, the control and test mixtures are individually 
subjected to modified ALIS analysis as follows. The large 
detergent-solubilized protein is separated from the small 
drug-like molecules by size exclusion chromatography (SEC) 
over a 4.6mm x 50 mm x 5 |J,m SEC column at 0°C using a 
running buffer of TBS (50 mM tris, pH 8.0, 150 mM NaCl, 2.5% 
DMSO) at a flowrate of 2 mL/minute . The eluting SEC 
fraction containing protein is identified by on-line light 
scattering detection and transferred by way of a sample loop 
to a low- flow (10 0 p,L/minute) reverse-phase chromatography 
(RPC) system. The RPC column (Higgins C-18; 1 mm x 50 mm x 
5 \xm) is maintained at 6 0°C to promote dissociation of 
ligands from the complex. From the RPC column, the ligand 
is eluted into a high-resolution mass spectrometer for 
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analysis using a gradient of 5%-95% acetonitrile (0.1% 
formic acid counterion) in water (w/ 0.1% formic acid) over 
5 minutes. Ligand of suitably high affinity bound to the 
COX-1 protein at the time its fraction is collected, and the 
mass of the ligand is identified by mass spectral analysis. 

Through the mass coding, the precise combination of 
building blocks and core molecule are identified (see U.S. 
Patent No. 6,147,344 by Annis et al . ) . If the same compound 
fails to appear in the glycophorin control experiment, the 
compound may then be identified as a specific ligand of the 
COX-1 protein. 

EXAMPLE 7: HETEROGENEOUS SOLUTION PHASE SCREENING FOR 

LIGANDS THAT BIND m2 mACHR : IDENTIFICATION 
USING AFFINITY SELECTION AND ALIS ANALYSIS 

HP ligands may also be screened by utilizing a 
heterogeneous solution phase screening method in which a 
tagged target sequence is immobilized on a solid support. 
For example, anti-flag antibody- loaded protein A agarose 
beads (anti-flag beads) are prepared for use as a 
sedimentable stationary element in an immunoprecipitation 
(IP) -based screening protocol. 

Briefly, using a buffer of TBS -AG (50 mM tris, pH 8.0, 
150 mM NaCl, 800 mM dodecyl P-D-maltoside) , 100 |iL of 50% 
v/v slurry of protein A agarose (Santa Cruz Biotechnology, 
St. Louis, MO.) are washed in a 1.5 mL eppendorf tube with 
three room temperature cycles of: (1) combining the beads 
with 1 mL TBS -AG; (2) mixing the sample by tumbling for 20 
minutes; and (3) centrif ugation at lOOOOxg, followed by a 
careful removal of the supernatant that leaves the pelleted 
agarose beads in the bottom of the tube. 

After the beads have been washed, the 50 |iL pellet of 
beads is brought up in 1.0 mL TBS -AG. To that mixture, 50 
|IL of 1 mg/mL (3-galactosidase in TBS -AG buffer is added and 
the mixture is incubated at 4°C for 60 minutes to block non- 
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specific binding of protein to the beads. To that mixture 
is added 10 |J,L of 1.0 |ig/mL anti-flag antibody (Sigma, St. 
Louis, Montana) , and the mixture is incubated at 4°C for 6 0 
minutes. In parallel, another preparation of control 

agarose beads handled similarly is treated with 10 |iL of 
TBS -AG instead of the anti-flag antibody. The anti-flag 
loaded beads and the control beads are then washed to remove 
excess antibody and protein and, finally, resuspended in 
0.10 mL of TBS -AG buffer and transferred to 0.5 mL eppendorf 
tubes . 

CHO cells expressing an m2 mAChR-flag tag-His tag 
protein are cultured, lysed, and homogenized. The m2 mAChR - 
containing membranes of the cells are purified by sucrose 
step-gradient ul tracentrif ugation . This sucrose step- 

gradient ultracentrifugation step removes most non-membrane 
proteins and cell debris. The ultracentrifugation step also 
has the potential of isolating specific populations of 
membranous cellular substructures. The mAChR- enriched 

membranes are solubilized with detergent (dodecyl-p- 
maltoside, DpM or CYMAL-7; (Anatrace; Maumee, OH)) and 
subjected to three steps of affinity purification. 

First, metal chelate affinity chromatography (MCAQ will 
be used to take advantage of the polyhistidine- tagged C- 
terminus, followed by a second affinity purification, an 
antibody affinity purification based on the FLAG-tagged N- 
terminus (Kobilka, B.K. (1995) Anal. Biochem. 231: 269). 
Third, ligand affinity purification over a column of 
immobilized mAChR ligand 3 - ( 2 ' -aminobenzhydryloxy) - tropane 
(ABT) , followed by a desalting step will yield a final 
enrichment of active m2 mAChR protein. 

The sample is adjusted to 20 m2 mAChR protein in a 

buffer of TBS-AGD and is incubated with a 2500 -member 
library of mass -coded compounds, each member at a 
concentration of 1 \iM. The reaction is done in a volume of 
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40 \ih. After a 30 minute incubation at 22°C, the sample is 
chilled at 4°C pending MS analysis. As a comparison test, 
P-galactosidase is similarly prepared in TBS-AGD at 20 \xM 
and incubated with the mass-coded library. 

The m2 mAChR protein -compound mixtures are prepared for 
analysis by MS analysis with the following procedure. The 
purified protein-library mixtures, either the m2 mAChR 
protein or P-galactosidase protein trials, are each split 
into two 2 0 U-L volumes. For the m2 mAChR- library mixture, 
one volume is combined with the anti-flag beads (anti- 
flag/protein A agarose ) for IP and the other volume is 
subjected to a mock IP by combination with the control 
agarose beads (buffer/protein A agarose beads) . Similarly, 
for the P-galactosidase mixture, one 20 uh volume is 
combined with the anti-flag beads for a control IP and the 
other 2 0 |iL volume is subjected to a mock IP by combination 
with the control agarose beads. These IPs proceed, mixed by 
tumbling, for 60 minutes at 4°C. Afterwards, each IP is 
washed with three room temperature cycles of: (1) combining 
the beads with 1 mL TBS -AG, (2) mixing by tumbling for 2 
minutes, and (3) centrif ugation at 10,000 g, followed by a 
careful removal of the supernatant that leaves the pelleted 
agarose beads in the bottom of the tube. Finally, each 50 
[iL, bed volume bead preparation is then resuspended in an 
additional 50 U-L of TBS (50 mM tris, pH 8.0, 150 mM NaCl) 
and kept at 4°C. At the completion of this process, four 
heterogeneous bead preparations are made: (1) m2 mAChR/anti- 
flag/protein A agarose (m2 beads) , (2) m2 

mAChR/buf f er/protein A agarose beads (m2 control beads) , (3) 
p-galactosidase/anti-f lag/protein A agarose beads (P~lac 
beads) , and (4) P-galactosidase/buf f er/protein A agarose 
beads (p-galactosidase control beads) . 
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These bead-based preparations constitute heterogeneous 
solution phase systems useful for screening in the following 
manner. Each 100 oh bead preparation is combined with 50 uL 
of 30% acetonitrile in water heated to 60°C for 5 minutes to 
dissociate bound ligands from the protein. Then the mixture 
is centrifuged at 10, 000 x g to pellet the beads, and the 
dissociated protein in the supernatant is transferred to a 
new tube . 

The IP procedure allows both an affinity selection of a 
ligand and the physical separation of the m2mAChR- ligand 
complex from unbound library members. These supernatants 
contain 50 mM tris, pH 8.0, 150 mM NaCl , 80 mM dodecyl p-D- 
maltoside, and 10% acetonitrile. Approximately 60 uL of 
these supernatant samples are injected individually into a 
low- flow (100 u-L/minutes) reverse-phase chromatography (RPC) 
system. The RPC column (Higgins C-18; 1 mm x 50 mm x 5 |J,m) 
is maintained at 60°C. From this RPC column, the ligand is 
eluted into a high-resolution mass spectrometer for analysis 
using a gradient of 5%-95% acetonitrile (0.1% formic acid 
counterion) in water (w/ 0.1% formic acid) over 5 minutes. 

m2 mAChR- ligands are identified by observing a mass 
corresponding to one compounds from the 25 0 0 -member mass- 
coded library in the supernatant of the m2mAChR protein 
beads and not from the supernatant of the m2mAChR protein 
control, p-lac library, or p-lac control beads. The 
structure of the mass -coded compounds selected in the 
procedure are easily identified (see U.S. Patent No. 
6,147,344 by Annis et al . ) . Using MS-MS analysis (see U.S. 
Patent No. 6,147,344 by Annis et al.), the exact structure 
of the core plus building block combination can also be 
pinpointed. 

EXAMPLE 9: HETEROGENEOUS SOLUTION PHASE SCREENING FOR 

LIGANDS THAT BIND m2 mACHR: IDENTIFICATION 
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USING A DISPLACEMENT ASSAY COMBINED WITH 
AFFINITY SELECTION AND MS ANALYSIS 

Flag-tagged m2 mAChR protein is purified as outlined 

herein above and resuspended at a concentration of 2 0 |iM in 

a buffer of TBS-AGD. The protein is incubated with a 2500- 

member library of mass -coded compounds, each member at a 

concentration of 1 |j.M and with 1 |j,M pirezepine, a known 

ligand for m2 mAChR protein, in a volume of 4 0 |J,L. After a 

30 minute incubation at 22°C, the sample was chilled at 4°C 

pending MS analysis. As a comparison test, (3- galactosidase 

is similarly prepared in TBS-AGD at 20 \xM and incubated with 

the mass-coded library and pirezepine. 

The purified protein-library-pirenzepine mixtures, from 

either the m2 mAChR protein or p-galactosidase trials, are 

each split into two 20 |J,L volumes. For the m2 mAChR- 

library-pirenzepine mixture, one volume is combined with the 

anti-flag beads (anti-flag/protein A agarose) for IP and the 

other volume is subjected to a mock IP by combination with 

the control agarose beads (buffer/protein A agarose beads) . 

Protein A agarose beads are prepared as previously described 

herein. Similarly, for the p-galactosidase mixture, one 20 

|J.L volume is combined with the anti-flag beads for a control 

IP and the other 2 0 |j,L volume is subjected to a mock IP by 

combination with the control agarose beads. These IPs 

proceed, mixed by tumbling, for 6 0 minutes at 4°C. Then 

each IP is washed with three room temperature cycles of: (1) 

combining the beads with 1 mL TBS -AG; (2) mixing by tumbling 

for 2 minutes; and (3) centrif ugation at 10,000 x g, 

followed by a careful removal of the supernatant that leaves 

the pelleted agarose beads in the bottom of the tube. 

Finally, each 50 |J,L bed volume bead preparation is then 

resuspended in an additional 5 0 |J,L of TBS (50 mM tris, pH 

8.0, 150 mM NaCl) and kept at 4°C. At the completion of 



58 

BOSTON 1318564vl 



Express Mail Label No.: EM300819915US 
Date of Deposit: December 19, 2001 



PATENT 

CASE NO. 111025.173; NG-016US 

this process, four heterogeneous bead preparations are made: 
(1) m2 mAChR/anti-f lag/protein A agarose (ra2 beads) ; (2) m2 
mAChR/buf f er/protein A agarose beads (m2 control beads); (3) 
(5-galactosidase/anti-f lag/protein A agarose beads (|3- 
galactosidase beads), and (4) P-galactosidase/buf f er/protein 
A agarose beads (p-galactosidase control beads) . 

These bead-based preparations constitute heterogeneous 
solution phase systems useful for screening in the following 
manner. Each 100 |J,L bead preparation is combined with 50 mL 
of 30% acetonitrile in water heated to 60°C for 5 minutes to 
dissociate any bound pirenzepine from the protein. Then the 
mixture is centrifuged at 10,000 x g to pellet the beads, 
and dissociated protein in the supernatant is transferred to 
a new tube. The IP procedure allows both an affinity 
selection of the ligand and the physical separation of the 
m2 mAChR-ligand complex from unbound library members. These 
supernatants contain 50 mM tris, pH 8.0, 150 mM NaCl, 80 mM 
dodecyl p-D-maltoside , 10% acetonitrile, and <50 pmol of 
pirenzepine. Approximately 60 [iL of these supernatants are 
injected into low- flow (100 |J.L/minute) reverse-phase 
chromatography (RPC) system. The RPC column (Higgins C-18; 
1 mm x 50 mm x 5 |im) is maintained at 60°C. From this RPC 
column, the ligand is eluted into a high-resolution mass 
spectrometer for analysis using a gradient of 5%-95% 
acetonitrile (0.1% formic acid counterion) in water (w/ 0.1% 
formic acid) over 5 minutes. 

The mass corresponding to pirenzepine from the 
supernatant of the m2 mAChR protein control, p- 
galactosidase library test, or P-galactosidase control beads 
should be higher than the pirenzepine mass response in the 
MS analysis of the m2 mAChR protein beads. It is then 
inferred that the 2500-member mass-coded library contains a 
ligand that binds to the m2 mAChR protein with an affinity 
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greater than that of pirenzepine. Libraries that contain 
hits can thus be detected and selected from among other 
libraries that do not contain hits. 



EXAMPLE 10: ENHANCING THE HYDROPHOBIC PROTEIN 

SCREENING SUCCESS BY MULTIPLEXING 

To maintain the amphiphile-solubilized HP in a three- 
dimensional conformation that enhances the success of 
screening, HPs are multiplexed in the preparation of 
screening samples. Multiplexing is defined to mean any 
method of preparation wherein the target protein is combined 
with some known molar equivalent of one or more accessory- 
proteins (APs) . Five independent criteria are identified to 
guide the selection of most favorable screening conditions 
with regard to the use of APs : (1) in the presence of the 
AP(s), the target HP is observed to bind a known agonist 
with greater affinity than without the AP(s) present; (2) in 
the presence of the AP(s), the target HP is observed to bind 
a known antagonist with greater affinity than without the 
AP(s) present; (3) in the presence of the AP(s), the HP is 
shown to have greater functional activity as assayed by 
enzymatic, in vitro, or cell -based assays,- (4) in the 
presence of the AP(s), the HP is shown to alter its state of 
multimerization; and (5) in the presence of the AP(s), the 
HP is shown to have greater conformational stability or 
uniformity. 

As a first example, a method for multiplexed screening 
of m2 mAChR with G ail p lY2 -Guanosine -diphosphate (GDP) as a 
heterotrimeric AP is presented. The G Kil p lY2 - - GDP complex was 
identified as an AP for the m2 mAChR because its presence 
enhanced the affinity of the m2 mAChR for an antagonist, N- 
methylscapolamine (NMS) by 5-fold as demonstrated by the 
following method. 3 H-NMS binding assays performed by the 
method of Rinken and Haga (Rinken, A. and Haga, T. (1993) 
Arch. Biocehm. Biophys . 301:158-164) showed that the 
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apparent Kd of m2 mAChR for 3H-NMS was 0.17 nM while Kd of 
the m2 mAChR- G H . lPlY2 -GDP complex was 432 nM. To screen this 
HP/AP complex, m2 mAChR/ G ailPlY2 - PLs were prepared as 
described in Example 11 and screened for ligand binding 
according to any of the methods utilizing affinity selection 
and ALIS or modified ALIS presented herein with the addition 
of 5 0 uM GDP to the binding buffer at the stage of 
incubation with the ligand compounds. 

As a second example, multiplexed screening of 
heterodimerized k and 6 subtypes of the human opioid 
receptor is presented. The human K opioid receptor was 
identified as an AP for the human 8 opioid receptor because 
its presence caused the multimerization state of the human 8 
opioid receptor to change from monomer to heterodimer as 
demonstrated by the method of Jordan and Devi (Jordan, B.A. 
and Devi, L.A., (1999) Nature 399:697-708). 

To prepare the 8/k opioid receptor heterodimer complex 
for screening, the following method is used. Human K-opioid 
receptor is purified and dialyzed into TBS -AG at a 
concentration of 50 UM. Human 5-opioid receptor is purified 
and dialyzed into TBS -AG at a concentration of 50 \xM. Equal 
volumes of these two preparations were combined to allow 
them to dimerize. 

The heterodimerized opioid receptor complex is then 
screened for ligand binding according to any of the methods 
utilizing affinity selection and ALIS or modified ALIS 
presented herein. 

EXAMPLE 11: METHODS FOR THE PREPARATION OF DEFINED 

GPCR PROTEOLIPOSOMES (GPCR-PL) 

To add conformational stability to detergent - 
solubilized GPCRs, the detergent micelle solubilization is 
exchanged for solubilization in defined proteoliposomes 
meeting the following criteria: (1) the lipid composition of 
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the PL is defined and characterized as having no more than 
six discrete lipid entities making up 90% of the lipid 
content of the PL; (2) 90% of the HP-PL preparation has a 
defined, unimodal size distribution of particles spanning no 
more than three orders of magnitude and wherein mean 
particle size is in the range of 5-10000 nm; (c) the PL 
preparation yields >50 nM HP. 

To prepare HP- PL bearing m2 mAChR protein, the 
following procedure is used. Synthetic lipids (Avanti Polar 
Lipids, Alabaster, AL) D-ribo-phytospingosine-l-phosphate 
and ceramide-C18 : 0 each in chloroform are combined in a 
volume of 1 mL each at 5 raM in a glass test tube, and the 
chloroform is evaporated at room temperature under a stream 
of argon for 24 hrs . The lipid residue is then wetted with 
0.4 0 mL of TBS, sonicated for 3 0 minutes at 2 8°C, and the 
mixture is transferred to a 1.5 mL polypropylene tube. To 
this lipid mixture is added 0.2 mL 50 |J,M m2 mAChR prepared 
as described in Peterson, G.L. et al . (Peterson, G.L. et al . 
(1995) J. Biol. Chem. 270: 17808). This yields a crude 
mixture of lipid and protein. 

The crude protein-lipid mixture is then incubated for 3 
hours at 8°C with 5 minute, 22°C bath sonication at 3 0 minute 
intervals (6 times) . This mixture is then subjected to 11 
passages through a 100 nm polycarbonate membranes in a small 
volume extruder according to the manufacturer's protocol 
(Avanti Polar Lipids, Alabaster, AL) at 20°C. This yields a 
crude PL preparation of 0.6 mL volume, 16 |lM m2 mAChR, and 
excess amphiphile (DbM and lipid) . To remove excess lipid 
and detergent, the crude PL preparation is passed through a 
10.0 mL desalting (G-50 sephadex column) previously 
equilibrated with TBS at 4°C. After application of the 
crude PL preparation to the desalting column, TBS at 4°C is 
used to elute the material from the column. This desalting 
procedure is conducted with a flow rate of 0.5 mL/minute. 
The protein concentration is estimated by combining a 10 |_iL 
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sample with 10 |iL of 0.2% Triton X-100 and using that 
mixture in a Bio-Rad colorimetric protein assay (Bio-Rad 
Inc., Hercules, CA) . The concentration is then adjusted by 
dilution with TBS to 10 \iM. 

The size distribution is defined using a Coulter N4 
subraicron particle analyzer following the manufacturer's 
protocols. An independent assessment of size distribution 
and mean particle size is accomplished by analytical SEC by 
injecting 20 U-L onto an analytical SEC column with a running 
buffer TBS at a flow rate of 1.0 mL/minute (G2000 SWXL , 5 x 300 
mm, 5 urn; ToSo-Haas) , fitted with a UV detector monitoring 
at 280 nm, and chilled to 12°C. A single peak eluting at 
8.23 minutes identified the retention time of the GPCR-PL' s 
which was compared to standard curve of molecular size 
marker elution times and indicated that the mean apparent 
size was comparable to that of a 940,000 dalton protein. 

Proteoliposomes may also be prepared with HPs complexed 
with APs. In this procedure, detergent micelle 

solubilization is exchanged with solubilization in defined 
proteoliposomes meeting the following criteria: (1) the 
lipid composition of the PL was defined and characterized as 
having no more than six discrete lipid entities making up 
90% of the lipid content of the PL/ (2) 90% of the HP-PL 
preparation has a defined, unimodal size distribution of 
particles spanning no more than three orders of magnitude 
and wherein mean particle size is in the range of 5-10000 
nm; (3) the PL preparation yields >5 0 nM HP; (4) the PL 
preparation included at least one other protein designated 
the accessory protein (AP) , the presence of which supports 
the maintenance of a favorable conformation for the target 

HP. 

To prepare HP-PL bearing m2 mAChR, the following 
procedure was used. Synthetic lipids (Avanti Polar Lipids, 
Alabaster, AL) D-ribo-phytospingosine-l-phosphate and 
ceramide-C18:0 each in chloroform are combined in a volume 
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of 1 mL each at 5 mM in a glass test tube and the chloroform 
is evaporated at room temperature under a stream of argon 
for 24 hrs. The lipid residue is then wetted with 0.40 mL 
of TBS plus 2 mM MgCl 2 (TBSM) , sonicated for 3 0 minutes at 
2 8°C, and mixture transferred to a 1.5 mL polypropylene 
tube. Separately, 50 uM G ail p lY2 was prepared by combining 
equal volumes of 100 |1M G ail with 100 uM Gp lY2 purified as 
described (ref) and dialyzed into TBSM plus 800 mM DbM 
( TBSM -AG) . To the lipid mixture is added 0.2 mL 50 uM m2 
mAChR prepared as described in Peterson, G.L. et al . 
(Peterson, G.L. et al . (1995) J. Biol. Chem. 270: 17808) and 
0.2 mL 50 uM G ail p lY2 . To this mixture is added 10 uL of 50 mM 
GDP. This yields a crude mixture of lipid, HP, and AP. 

The crude HP/AP- lipid mixture is then incubated for 3 
hours at 8°C with 5 minute, 22°C bath sonication at 3 0 minute 
intervals (6 times) . This mixture is then subjected to 11 
passages through a 100 nm polycarbonate membranes in a small 
volume extruder according to the manufacturer's protocol 
(Avanti Polar Lipids, Alabaster, AL) at 20°C. This yields a 
crude PL preparation of 0.8 mL volume, 12 uM mAChR, 12 uM 
G anPiY2' an d excess amphiphile (DbM and lipid) . To remove 
excess lipid and detergent, the crude PL preparation is 
passed through a 10.0 mL desalting (G-50 sephadex column) 
previously equilibrated with TBS at 4°C. After application 
of the crude PL preparation to the desalting column, TBS at 
4°C is used to elute the material from the column. This 
desalting procedure is conducted with a flow rate of 0.5 
mL/minute. The protein concentration is estimated by 
combining a 10 UL sample with 10 UL of 0.2% Triton X-10 0 and 
using that mixture in a Bio-Rad colorimetric protein assay 
following the manufacture's instructions. The concentration 
is then adjusted by dilution with TBS to 2 0 uM. 

To define size distribution, a Coulter N4 submicron 
particle analyzer is used according to the manufacturers 
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protocol. An independent assessment of size distribution 
and mean particle size is accomplished by analytical SEC by 
injecting 2 0 uL onto an analytical SEC column with a running 
buffer TBS at a flow rate of 1.0 mL/minute (G2 0 00 SKXL , 5 x 300 
mm, 5 [im; ToSo-Haas) , fitted with a UV detector monitoring 
at 280 nm, and chilled to 12°C. A single peak eluting at 
8.23 minutes post -injection identified the retention time of 
the HP/AP-PL's which is compared to standard curve of 
molecular size marker elution times. The resultant mean 
apparent size is comparable to that of a 940,000 dalton 
protein. 

Control AP-PLs for use as screening controls are made 
to meet the following criteria: (1) the lipid composition of 
the PL was defined and characterized as having no more than 
six discrete lipid entities making up 90% of the lipid 
content of the PL; (2) 9 0% of the HP- PL preparation has a 
defined, unimodal size distribution of particles spanning no 
more than three orders of magnitude and wherein mean 
particle size is in the range of 5-10000 nm; (3) the PL 
preparation is designed to exactly mimic the analogous 
HP/AP-PL except no target HP is present. 

To prepare AP-PL bearing m2 mAChR, the following 
procedure is used. Synthetic lipids (Avanti Polar Lipids, 
Alabaster, AL) D-ribo-phytospingosine-1 -phosphate and 
ceramide-C18 : 0 each in chloroform are combined in a volume 
of 1 mL each at 5 mM in a glass test tube and the chloroform 
is evaporated at room temperature under a stream of argon 
for 24 hours. The lipid residue is then wetted with 0.4 0 mL 
of TBS plus 2 mM MgCl 2 (TBSM) , sonicated for 3 0 minutes at 
28 C, and mixture transferred to a 1.5 mL polypropylene 
tube. Separately, 50 |J,M G« ilPlY2 was prepared by combining 
equal volumes of 100 [iM G &il with 100 UM G PlY2 purified as 
described Hou, Y. et al . , J. Biol. Chem. (2000) 275:38961-6 
and dialyzed into TBSM plus 800 mM DbM (TBSM-AG) . To the 
lipid mixture is added 0 . 2 mL TBS - AG and 0.2 mL 50 \xM G a . p Y . 
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To this mixture is added 10 (XL of 50 mM GDP. This yields a 
crude mixture of lipid and AP. The protein concentration is 
estimated by combining a 10 |iL sample with 10 |iL of 0.2% 
Triton X-100 and using that mixture in a Bio-Rad 
colorimetric protein assay following the instructions of the 
manufacturer. The concentration is then adjusted by 
dilution with TBS to 10 |iM. 

This crude AP- lipid mixture is then incubated for 3 
hours at 8°C with 5 minute, 22°C bath sonication at 30 minute 
intervals (6 times) . This mixture is then subjected to 11 
passages through a 100 nm polycarbonate membranes in a small 
volume extruder according to the manufacturer's protocol 
(Avanti Polar Lipids, Alabaster, AL) at 20°C. This yields a 
crude PL preparation of 0.8 mL volume, 12 |iM mAChR, 12 |J.M 
G «iiPiY 2 ' and excess amphiphile (DbM and lipid) . To remove 
excess lipid and detergent, the crude PL preparation is 
passed through a 10.0 mL desalting (G-50 sephadex column) 
previously equilibrated with TBS at 4°C. After application 
of the crude PL preparation to the desalting column, TBS at 
4°C is used to elute the material from the column. This 
desalting procedure is conducted with a flow rate of 0.5 
mL/minute . 

To define size distribution, a Coulter N4 submicron 
particle analyzer is used according to the manufacturers 
protocol. An independent assessment of size distribution 
and mean particle size is accomplished by analytical SEC by 
injecting 2 0 |iL onto an analytical SEC column with a running 
buffer TBS at a flow rate of 1.0 mL/minute (G2000 SWXL , 5 x 300 
mm, 5 |J,m; ToSo-Haas) , fitted with a UV detector monitoring 
at 280 nm, and chilled to 12°C. A single peak eluting at 
8.23 minutes identified the retention time of the AP-PL's 
which is compared to standard curve of molecular size marker 
elution times. The resultant mean apparent size is 
comparable to that of a 940,000 dalton protein. 

66 

Express Mail Label No.: EM300819915US 
Date of Deposit: December 19, 2001 



PATENT 

CASE NO. 111025.173; NG-016US 

The proteoliposomes described herein are then screened 
for ligand binding according to any of the methods utilizing 
affinity selection and ALIS or modified ALIS presented 
herein. 

EXAMPLE 12: DUAL EPITOPE AFFINITY PURIFICATION OF HPs 

The construction of nucleic acid sequences encoding the 
tagged HP proteins of the invention is well within the skill 
of those in the art, utilizing routing procedures common in 
the art of nucleic acid cloning. 

Tagged-HPs, e.g., Mellitin-Flag Tag-Human ml mAChR-EE- 
tag protein or Mellitin-Flag Tag-Human Beta 2 Adrenergic 
Receptor-EE- tag protein, are produced from a recombinant 
baculovirus following the methodology provided by the 
manufacturer of the viral expression system (Pharmingen, San 
Diego, CA) . 

Briefly, recombinant virus is selected based on its 
ability to direct the expression of the recombinant 
protein (s). Protein expression is confirmed by western blot 
analysis of detergent-solubilized cell lysates of the 
infected insect cells. Using primary antibodies that 

recognize either the Flag epitope, the EE epitope, or the HP 
protein itself, the western blot reveals whether or not, and 
to what degree, the HP proteins are expressed. If possible, 
a functional assay is performed to determine that the 
expressed protein is functional. For example, for the 
Mellitin-Flag Tag-Human ml mAChR-EE-tag protein, a cell- 
based 3 H-N-methylscapolamine binding analyses is performed 
by the method of Rinken and Haga (Rinken, A. and Haga, T. 
(1993) Arch. Biocehm. Biophys . 301 : 158-164) confirmed that 
the virus-directed protein expression was functional, 
indicating a Bmax of 0.5xl0 6 receptors per cell and a Kd of 
0.10 nM. This baculovirus was then amplified by to a high 
titer of 2.0x10 s pfu/mL by conventional methods (Pharmingen, 
San Diego, CA) . 
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Next, large-scale insect cell cultures are obtained for 
the isolation of the proteins. Briefly, once high titer 
virus stocks are generated the constructs of interest, ten 
10 liter production runs were executed, growing Sf21 insect 
cells to a density of 2xl0 6 cells/mL in a bioreactor with 
wave agitation (Wave Biotechnology, Bedminster, NJ) . For 
each protein produced, these cells were inoculated with high 
titer virus stock at an multiplicity of infection (MOD of 5 
and two days post -infection the cells were harvested. 

The HP proteins are then solubilzed. For example, 
mAChR protein expressing cells are collected by 
centrifugation for 30 minutes at 10, 000 x g. The cell 
pellets from all ten bioreactor production runs are 
combined, and the cell pellets are suspended in 500 mL of 



p 15 ice cold TBS buffer plus 1 mM EDTA, 10 |ig/ml pepstatin and 1 

O 

uj! M-g/ml pmethylsulf onyl fluoride , and 2 0 mM dodecyl-p-D- 

maltoside (DbM) . This cell slurry is subjected to 50 
if!J strokes in a pestle A dounce homogenizer at 4°C to break 

open the cells and solubilize the membrane proteins. To 
20 clarify this suspension, the material is centrifuged for 60 
!h minutes at 40000xg at 4°C to remove insoluble material. The 

supernatant is then collected and used as a source of 
soluble Mellitin-Flag Tag-Human ml mAChR-EE protein. (Note: 
The honey bee mellitin signal sequence is cleaved off by 
25 cellular proteases in the process of expression and 
trafficking to the plasma membrane of the insect cell. From 
this point on the solubilized protein is referred to as Flag 
Tag-Human ml mAChR-EE.) 

Each solubilized recombinant HP is then purified with 
30 two rounds of affinity purification over an antibody 
affinity column. Both the anti-EE-tag and anti-Flag-tag 
columns are prepared similarly. Briefly, 35 mg of purified 
anti-epitope antibody is dialyzed into coupling buffer and 
coupled to 10 mL CNBr-act ivated sepharose beads according 
35 the manufacturers protocol (Amersham Pharmacia, Piscataway, 
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NJ) . The resin is then transferred to a 1.2 x 15 cm 
polypropylene column, and the epitope affinity columns are 
then equilibrated with TBS -AG at 4°C. 

Next, 50 mL of solubilized HP protein, e.g., FlagTag- 
Human ml mAChR-EE, is applied to the ant i -EE affinity column 
at a flow rate of 0.2 mL per minute. The column was then 
washed with 5 column volumes of TBS -AG at the same flow 
rate. The specifically-bound HP is then eluted from the 
column with a bolus of excess EE peptide (NH2 -EEEEYMPME- 
COOH; Sigma-Genosys, St. Louis, MO) solubilized in a volume 
of 15 mL in TBS -AG at 10 mM. The eluant is then applied 
similarly to the anti-Flag-tag affinity column, similarly 
washed, and eluted from the column with excess anti-FLAG 
peptide (NH2-DYKDDDDK-COOH; Sigma-Genosys, St. Louis, MO) . 
This material is then dialyzed with two exchanges into a 100 
fold volume of TBS -AG overnight at 4°C. 

Sucrose gradient ultracentrif ugation is then used to 
remove misfolded polypeptide. Briefly, the material is 
applied to a discontinuous step gradient of 5%-25% sucrose 
in TBS -AG and centrifuged in a Beckman SW Ti.50 rotor at 
lOOOOOxg for 4 hours at 4°C. The properly conformed HP, 
e.g., FlagTag-Human ml mAChR-EE, material is collected at 
the 5%-25% interface. This material is then subjected to a 
final dialysis with two exchanges into a 10 0 fold volume of 
TBS -AG overnight at 4°C . 

The protein concentration of the HP solution is 
determined by colorimetric protein assay (Bio-Rad 
Laboratories, Inc., Hercules, CA) and adjusted as necessary, 
by dilution or concentration in a stirred ultrafiltration 
cell (Millipore, Bedford, MA), to 50 |iM. The detergent 
concentration is also determined by RPC-UV using a Higgins 
C-18 column whereupon 5 0 [XL of the mixture the ligand is 
eluted into a high-resolution mass spectrometer for analysis 
using a gradient of 5%-95% acetonitrile (0.1% formic acid 
counterion) in water (w/ 0.1% formic acid) over 2 0 minutes. 
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The Rt of the DbM is determined by a separate control 
experiments under identical conditions. To quantitate the 
amount of detergent, the area under the DbM peak from the 
protein sample is compared to a standard curve generated 
from multiple RPC runs with identical conditions assaying 
the DbM peak heighth for DbM samples of known concentration. 
If the detergent concentration is estimated to be below 0.4 8 
mM (4x cmc) , it is adjusted with the addition of a small 
amount of concentrated DbM in TBS. 

EXAMPLE 13: EPITOPE AFFINITY AND METAL CHELATE AFFINITY 

CHROMATOGRAPHY (MCAC) OF HPs 

HPs are also constructed with an epitope affinity tag 

and a metal chelate affinity tag, as represented, for 

example, by Hemagglutinin SS-Rat m3 mAChR-HSV-OctaHis . 

Briefly, an isolated nucleic acid molecule encoding 

Hemagglutinin SS-Rat m3 mAChR-HSV-OctaHis is used in the 

production of a recombinant baculovirus following methods 

provided by the manufacturer (Pharmingen, San Diego, CA) . 

Virus production, the large-scale insect cell culture and 

expression of protein, the preparation of epitope affinity 

columns, the preparation of solubilized HP, and epitope 

affinity purification were all performed as previosly 

described herein except that anti-HSV antibody replaces the 

anti-FLAG antibody in the preparation of the HSV epitope 

affinity column and the HSV peptide (NH2 -QPELAPEDPED-COOH; 

Sigma -Genosys) is used to elute the Hemagglutinin SS-Rat m3 

mAChR-HSV-OctaHis protein from the column instead of the 

FLAG peptide. Also, since there is no EE epitope on the 

Hemagglutinin SS-Rat m3 mAChR-HSV-OctaHis protein, no 

epitope affinity purification using the EE epitope is used. 

(Note: The Hemagglutinin SS signal sequence may be cleaved 

off in some cell lines by cellular proteases in the process 

of expression and trafficking to the plasma membrane of the 
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insect cell. From this point on the solubilized protein is 
referred to as Rat m3 mAChR-HSV-OctaHis . 

For the final affinity purification step, Rat m3 mAChR- 
HSV-OctaHis is applied, at a flow rate of 0.2 mL/minutes, to 
a MCAC column prepared by loading 10 mL of Ni-NTA resin 
(Qiagen) into a 1.2 x 15 cm polypropylene column and 
equilibrating the column with 100 mL of TBS - AG at 4°C at a 
flow rate of 0.2 mL/minutes. Once the Rat m3 mAChR-HSV- 
OctaHis is bound to the column, the column is washed with 
100 mL of TBS -AG containing 5 raM imidazole. Then, the 
column is developed with a 50 mL linear gradient of 5 mM-350 
mM imidazole in TBS -AG at the same flow rate, collecting 0.5 
mL fractions. Fractions containing active mAChR, as 

assessed by a known radioligand binding assay, elute at 
imidazole concentrations of 190-240 mM in a volume of 11 mL. 
This material is then dialyzed with three exchanges into a 
100 fold volume of TBS -AG overnight at 4°C to remove excess 
imidazole. The material is then subjected to sucrose 
gradient ultracentrif ugation and the concentrations of 
protein and detergent are estimated and adjusted as 
previously described herein. 

EXAMPLE 14 : CHARACTERIZATION OF PURIFIED HPs 

To confirm the purity of the final preparation of the 
HP(s) produced herein, each sample is subjected to SDS-PAGE 
analysis on 5-12% Novex (Invitrogen, San Diego, CA) gels 
according to the manufacturer's instructions. Ten |ag of 
sample are loaded per gel lane, and the protein samples are 
visualized by silver staining (Peterson, G.L. et al . (1995) 
J. Biol. Chem. 270: 17808). The specific activities of the 
mAChR proteins prepared according to these methods are 
determined according the method of Peterson, G.L. et al . 
(Peterson, G.L. et al . (1995) J. Biol. Chem. 270: 17808). 
For the methods described herein, typically the specific 



BOSTON 1318564V1 



71 



Express Mail Label No.: EM300819915US 
Date of Deposit: December 19, 2001 



PATENT 

CASE NO. 111025.173; NG-016US 

activity of the proteins is determined to be in the range of 
11-16 nmol of specific ligand binding per mg mAChR protein. 

Ligand affinity chromatography is another measure by 
which the HPs of the invention may be evaluated for specific 
activity. For example, the mAChR purified by the methods 
described herein may be subjected to known ligand affinity 
chromatography over a column of immobilized mAChR ligand 3- 
(2' -aminobenzhydryloxy) -tropane (ABT) . 
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EQUIVALENTS 

Those skilled in the art will recognize, or be able to 
ascertain, using no more than routine experimentation, many 
equivalents to the specific embodiments of the invention 
described herein. Such equivalents are intended to be 
encompassed by the following claims. 
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